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NOVEL PROBES FOR THE DETECTION OF MYCOBACTERIA 

The present invention relates to novel probes and to mixtures of such probes, in addition to 
the design, construction and use of such novel probes or a mixture thereof for detecting the 
presence of mycobacteria, which probes are capable of detecting the organisms in test 
samples, e.g. sputum, expectorates, aspirates, cerebrospinal fluid, urine, blood and tissue 
sections, food, soil and water. The invention relates in particular to novel probes and mixtures 
thereof for the detection of mycobacteria of the Mycobacterium tuberculosis Complex (MTC) 
and the detection of one or more mycobacteria other than mycobacteria of the Mycobacterium 
tuberculosis Complex. The invention further relates to diagnostic kits comprising one or more 
of such probes. 

BACKGROUND OF THE INVENTION 

Mycobacteria are slow growing, acid fast, aerobic bacilli. At least nineteen Mycobacterium 
species have been associated with disease in humans, among them M. tuberculosis, M. bovis, 
M. avium, M. intracellular and M. leprae. Species that are not normally pathogenic to healthy 
individuals may cause disease in immunocompromised individuals, e.g. those infected with 
HIV. M. kansasii and M. xenopi cause lung infection, and infection with M. marinum causes 
arthritis and osteomyelitis. Other clinically relevant mycobacteria are M. microti, M. 
paratubercuiosis, M. scrofulaceum, M. africanum M. gordonea, M. chelonei and M. fortuitum. 
The MTC group includes M. tuberculosis, M. bovis, M. kansasii and M. africanum. M. avium, 
M. intracellular, M. paratubercuiosis and M. lepraemurium are included in a group named 
Mycobacterium avium Complex. Classification and further description of the various 
mycobacteria can be found in e.g. Clinical Microbiology Reviews, 1-25 (January 1992) and 
Clinical Microbiology Reviews, 266-310 (July 1993). 

One very life-threatening and highly epidemic disease is tuberculosis caused by infection with 
mycobacteria of the MTC group, in particular M. tuberculosis. Tuberculosis is presently the 
predominant infectious cause of morbidity and mortality world-wide, and is estimated to kill 
about three million people annually. WHO estimates that the annual number of new cases of 
tuberculosis will increase from 7.5 million in 1990 to 10.2 million in 2000, an escalation that will 
result in approximately 90 million new cases during this decade. It is furthermore estimated 
that 30 million people will die from tuberculosis during the 1990s, which equals one quarter of 
preventable deaths among adults. 

The prevalence of tuberculosis has been very high in the poorer parts of the world such as 
Asia, Africa and South-America, but in recent years an increase has also been observed in 
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industrialised countries. This appears to be due to an interaction of various factors including 
i.a. patterns of migration, poorly organised tuberculosis programmes and nutrition problems. 
Furthermore, a serious threat will arise from the emergence of new strains that are multi-drug 
resistant. 

Considering the perspective and impact the disease has, the development of rapid, specific 
and preferably easy-performed and economic feasible diagnostic detection tests are of utmost 
importance and would be a very valuable too! in the fight against the spread of tuberculosis. 

Presently, the detection of mycobacteria by microscopy gives the more accurate diagnosis. 
The sample (e.g. an expectorate) is stained for the presence of acid-fast bacilli using Ziehl- 
Neelsen staining. However, Ziehl-Neelsen staining does not provide the necessary information 
about the type of infection, only whether acid fast bacilli are present in the sample. Ziehl- 
Neelsen positive samples may subsequently be cultured. Cultivation is sensitive, and it may 
be possible to detect 10-100 organisms per sample, but the result is not available before up to 
8 weeks of cultivation. Likewise, information of drug susceptibility is not available until after 1-3 
weeks of further testing. 

Since Ziehl-Neelsen staining cannot be used to determine whether the infection is caused by 
mycobacteria of the MTC group or mycobacteria other than mycobacteria of the MTC group, a 
positive staining leads frequently to very costly isolation of all patients as well as treatment 
with medicaments to which the patient may not even respond. Species identification is 
presently carried out following cultivation using traditional biochemical methods or probe 
hybridisation assays (e.g. AccuProbe by Gen-Probe Inc.). There is, therefore, an increasing 
need for means allowing a more rapid distinction between mycobacteria of the MTC group and 
mycobacteria other than those of the MTC group. 

Automated detection is rapidly becoming available for large scale testing for the presence of 
mycobacteria. Such systems include ESP Myco Culture System (Difco), MB/BacT (Organon 
Teknika) and MGIT (Becton Dickinson). These test methods are based on colorimetric or 
fluorometric detection of carbon dioxide or oxygen produced by mycobacterial metabolism. 

Some of the attempts to replace the methods based on cultivation rely on target amplification. 
One of such newly developed target amplification method is based on PCR. The principle of 
this reaction is, through amplification of specific nucleic acid sequences of the mycobacteria, 
to increase the copy number of the specific sequence to a level where it may be detectable in 
an early stage of the infection. In principle, the PCR reaction offers the possibility of detecting 
as few as one target sequence. In most cases, the DNA is extracted prior to carrying out the 
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PCR reaction. However, it has become clear that the target amplification test cannot replace 
culture test as only samples which are Ziehl-Neelsen positive give a satisfactory sensitivity. 

Furthermore, false negative results may be obtained due to the presence of inhibitors of the 
5 PCR reaction such as haemoglobin and other proteins. 

Another problem arises from cross-contamination of negative specimens with bacteria not 
present in the sample. This may cause problems in conventional bacteriological procedures 
and may lead to a positive PCR result. Contamination of reagents and specimens with 
10 amplified PCR products is yet another well-recognised problem when using a PCR-based test. 

Nucleic acid probes for detecting rRNA of mycobacteria have been described in for example 
US 5 547 842, EP-A 0 572 120 and US 5 422 242. 

15 SUMMARY OF THE INVENTION 

The present invention relates to novel peptide nucleic acid probes and to mixtures of such 
probes for detecting the presence of mycobacteria in a sample. In accordance with claim 1, 
the probes are directed to sequences in rRNA and genomic sequences corresponding to said 
20 rRNA (rDNA). rRNA is present in a high number of copies in each cell, and hence a well suited 
target for a sensitive test. The probes are, as defined in claim 2, suitably directed to target 
sequences in the 23S, 16S or 5S rRNA or DNA coding for said rRNA. 

Thus, in a first aspect, the invention features a hybridisation assay probe and a mixture of 
25 such probes for detecting the presence of mycobacteria in accordance with claim 1 and 2. 
Such probes should not to any significant degree cross react with nucleic acid from other 
organisms in the test sample under appropriate stringency conditions. Cross reactivity to 
organisms that are unlikely to appear in the sample may not be of importance. In in situ 
assays implying examination by microscopy, it is possible to distinguish mycobacteria from 
30 other bacteria based on evaluation of the morphology of the observed bacilli. 

In another aspect, the invention relates to novel peptide nucleic acid probes for detecting the 
presence of mycobacteria of the MTC group, in particular M. tuberculosis, and one or more 
mycobacteria other than mycobacteria of the MTC group, in particular mycobacteria of the M. 
35 avium Complex (claim 3). 

Claims 4 to 6 and 9 to 14 relate to probes for detecting the presence of mycobacteria of the 
MTC group. Claims 7 to 13 relate to probes for detecting one or more mycobacteria other than 
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mycobacteria of the MTC group. Claims 15 relates to a mixture of peptide nucleic acid probes 
according to claims 1 to 14. 

in a further aspect, as defined in claims 16 to 18, the invention relates to the use of the 
5 peptide nucleic acid probes and mixtures of such probes according to claims 1 to 15. 

The present invention also relates to a method for detecting the presence of mycobacteria in 
accordance with claims 19 to 26. 

10 In yet another aspect, the present invention relates to a kit (claim 27 and 28) comprising at 
least one peptide nucleic acid probe as defined in claims 1 to 14. 

BRIEF DESCRIPTION OF THE FIGURES 

15 Alignments of rRNA sequences of M. tuberculosis (as a representative of the MTC group) and 
important closely related species thereto, and M. avium (as a representative of the 
mycobacterial other than those of the MTC group) and important closely related species 
thereto for the 23S, 16S and 5S rRNA have been made (Figures 1A-1K, 2A-2D, 3, 4A-4N and 
5A-B). 

20 

Alignment for the MTC group (23S rRNA) 

Figures 1 A-1 K show alignments of 23S rDNA sequences of M. tuberculosis (GenBank entry 
GB:MTCY130, accession number Z73992), M. avium (GenBank entry GB:MA23SRNA, 
accession number X74494), M. paratuberculosis (GenBank entry GB:MPARRNA, accession 

25 number X74495), M. phiei (GenBank entry GB:MP23SRNA, accession number X74493), M. 
leprae (GenBank entry GB:ML5S23S, accession number X56657), M. gastri (GenBank entry 
GB:MG23SRRNA, accession number Z1721 1), M. kansasii (GenBank entry 
GB:MK23SRRNA, accession number Z17212), and M. smegmatis (GB:MS16S23S5, 
accession number Y08453). Preferred peptide nucleic acid probes should enclose at least one 

30 nucieobase complementary to a nucleobase of M. tuberculosis 23S rRNA within positions 

149-158, 220-221, 328-361, 453-455, 490-501, 637-660, 706-712, 762-789, 989, 1068-1072, 
1148, 1311-1329, 1361-1364, 1418, 1563-1570, 1627-1638, 1675-1677, 1718, 1734-1740, 
1967-1976, 2403-2420, 2457-2488, 2952-2956, 2966-2969, 3000-3003, and 3097-3106 of the 
alignment (indicated by heavy frames). Mismatches between the sequences of M. avium, M. 

35 phlei, M. leprae, M. paratuberculosis, M. gastri and M. kansasii and that of M. tuberculosis in 
the alignment are indicated by light frames. 

Alignment for the MTC group (16S rRNA) 
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Figures 2A-2D show alignments of 16S rDNA sequences of M. tuberculosis (GenBank entry 
GB.MTU16SRN, accession number X5291 7), M. bovis (GenBank entry GB.MSGTGDA, 
accession number M20940), M. avium (GenBank entry GB:MSGRRDA, accession number 
M61673), M. intracellulare (GenBank entry GB:MIN16SRN, accession number X52927), M. 
paratuberculosis (GenBank entry GB:MSGRRDH, accession number M5680), M. 
scrofulaceum (GenBank entry GB:MSC16SRN, accession number X52924), M. leprae 
(GenBank entry GB:MLEP16S1, accession number X55587), M. kansasii (GenBank entry 
GB:MKRRN16, accession number X1 5916), and M. gastri (GenBank entry GB:MGA16SRN, 
accession number X52919). Preferred peptide nucleic acid probes should enclose at least one 
nucleobase complementary to a nucleobase of M. tuberculosis 16S rRNA within positions 76- 
79, 98-101, 135-136, 194-201, 220-229, 242, 474, 1136-1145, 1271-1272, 1287-1292, 1313, 
and 1334 of the alignment (indicated by heavy frames). Mismatches between the sequences 
of M. bovis, M. avium, M. intracellulare, M. paratuberculosis, M. scrofulaceum, M. lepeae, M. 
kansasii, and M. gastri and that of M. tuberculosis in the alignment are indicated by light 
frames. 

Alignment for the MTC group (5S rRNA) 

Figure 3 shows alignments of 5S rDNA sequences of M. tuberculosis (GenBank entry 
GB:MTDNA16S, accession number X75601), M. bovis (GenBank entry GB:MBRRN5S, 
accession number X05526), M. phlei (GenBank entry GB:MP5SRRNA, accession number 
X55259), M. leprae (GenBank entry GB:ML5S23S, accession number X56657), and M. 
smegmatis (GenBank entry GB:MS16S23, accession number Y08453). Preferred peptide 
nucleic acid probes should enclose at least one nucleobase complementary to a nucleobase 
of M. tuberculosis 5S rRNA within position 86-90 of the alignment (indicated by heavy frame). 
Mismatches between the sequences of M. bovis, M. phlei, M. leprae, M. smegmatis and M. 
luteus and that of M. tuberculosis in the alignment are indicated by light frames. 

Alignment for Mycobacteria other than those of the MTC group (23S) 
Figures 4A-4N show alignments of 23S rDNA sequences of M. avium (GenBank entry 
GB:MA23SRNA, accession number X74494), M. paratuberculosis (GenBank entry 
GB:MPARRNA, accession number X74495), M. tuberculosis (GenBank entry GB:MTY130, 
accession number Z73992), M. phlei (GenBank entry GB:MP23SRNA. accession number 
X74493), M. leprae (GenBank entry GB:ML5S23S, accession number X56657), M. gastri 
(GenBank entry GB:MG23SRRNA, accession number Z17211), M. kansasii (GenBank entry 
GB:MK23SRRNA, accession number Z17212), and M. smegmatis (GB:MS16S23S5, 
accession number Y08453). Preferred peptide nucleic acid probes should enclose at least one 
nucleobase complementary to a nucleobase of M. avium 23S rRNA within positions 99-101, 
183, 261-271, 281-284, 290-293, 327-335, 343-357, 400-405, 453-462, 587-599. 637-660, 
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704-712, 763-789, 1060-1074, 1177-1185, 1259-1265, 1311-1327, 1345-1348, 1556-1570, 
1608-1613, 1626-1638. 1651-1659, 1675-1677, 1734-1741, 1847-1853, 1967-1976, 2006- 
2010, 2025-2027, 2131-2232, 2252-2255. 2396-2405, 2416-2420, 2474-2478, 2687. 2719, 
2809, 3062-3068, and 3097-3106 of the alignment (indicated by heavy frames). Mismatches 
5 between the sequences of M. paratuberculosis, M. tuberculosis, M. phlei, M. leprae. M. gastri, 
M. kansasii, M. luteus, and M. smegmatis and that of M. avium in the alignment are indicated 
by light frames. 

Alignment for Mycobacteria other than those of the MTC group (16S) 
10 Figures 5A-5B show alignments of 16S rDNA sequences of M. avium (GenBank entry 
GB:MSGRRDA, accession number X52917), M. intracellular (GenBank entry 
GB:MIN16SRN, accession number X52927),M. paratuberculosis (GenBank entry 
GB:MSGRRDH, accession number M5680), M. scrofulaceum (GenBank entry GB: 
MSC16SRN, accession number X52924), M. tuberculosis (GenBank entry GB:MTU16SRN, 
15 accession number X52917), M. bovis (GenBank entry GB:MSGTGDA, accession number 

M20940), M. leprae (GenBank entry GB:MLEP16S1. accession number X55587), M. kansasii 
(GenBank entry GB:MKRRN16, accession number X159916), and M. gastri (GenBank entry 
GB:MGA16SRN, accession number X52919). Preferred peptide nucleic acid probes should 
enclose at least one nucleobase complementary to a nucleobase of M. avium 16S rRNA 
20 within positions 135-136, 472-475, 1136-1144, 1287-1292, 1313, and 1334 of the alignment 

(indicated by heavy frames). Mismatches between the sequences of M. intracellulare, M. 

paratuberculosis, M. scrofulaceum, M. tuberculosis, M. bovis, M. leprae, M. kansasii, and M. 

gastri and that of M. avium in the alignment are indicated by light frames. 



25 SPECIFIC DESCRIPTION 
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The present invention provides novel probes for use in rapid and sensitive hybridisation based 
assays for the detection of mycobacteria, in particular mycobacteria of the MTC group, and 
one or more mycobacteria other than mycobacteria of the MTC group. 



We have identified suitable variable regions of the target nucleic acid by comparative analysis 
of generally available rRNA sequences. Computers and computer programs, which have been 
used for the purposes disclosed herein, are generally available. From such alignments, 
possibly suitable probes can be identified. The alignments are thus a useful guideline for 
35 designing probes with desired characteristics. The extent and specificity of hybridisation 

between the probe and its target are affected by a number of factors, whereby manipulation of 
one or more of those factors will determine the exact sensitivity and specificity of a probe in 
question. 
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When designing the probes, due regard should be taken to the assay conditions under which 
the probes are to be used. The stringency of the assay conditions determines the degree of 
complementarity needed between the probe and nucleic acid for formation of hybrids. 
Stringency is chosen so as to maximise the difference in stability between the hybrid formed 
with the target nucleic acid and that formed with the non-target nucleic acid. It will typically be 
necessary to choose high stringency conditions for probes which specificity depend on only 
one mismatch to non-target sequences. The more mismatches, the less demand for high 
stringency conditions. 

Furthermore, probes should be positioned so as to minimise the stability of probe:non-target 
nucleic acid hybrids. This may be accomplished by minimising the degree of complementarity 
to non-target nucleic acid, i.e. by designing the probe to span as many destabilising 
mismatches as possible, and to include as many additions/deletions relative to the target 
sequence as possible. Whether a probe is useful to detect a particular mycobacterial species 
depends largely on the thermal stability difference between probe:target hybrids and 
probe: non-target hybrids. For rRNA targets, however, the secondary structure of the region of 
the rRNA molecule in which the target sequence is located may also be of importance. 

Hybrids formed between peptide nucleic acid probes and nucleic acids have a higher thermal 
instability of mismatching bases compared to nucleic acid duplexes of the same sequences. 
Thus, the peptide nucleic acid probes exhibit a greater specificity for a target nucleic acid 
sequence than a traditional nucleic acid probe, which is seen as a greater difference in T m 
values for probe.target hybrids and probe:non-target hybrids. 

The primary concern regarding the length of the peptide nucleic acid probes is the warranted 
specificity, i.e. what length is specific enough for that particular application. The optimal length 
of an oligomer probe comprising a particular site with differences in base composition, e.g. 
among selected regions of mycobacterial rRNA, is a compromise between the general pattern 
that longer probes ensure specificity and shorter probes ensure that the destabilising 
differences in base composition constitute a greater portion of the probe. 

Peptide nucleic acids can form duplexes in either orientation, but the antiparallel orientation 
forms the most regular and stable duplex. Hence the antiparallel configuration is preferred for 
probe applications. Triplex formation with a stoichiometry of two peptide nucleic acid strands 
and one nucleic acid strand may occur if the peptide nucleic acid has a high pyrimidine 
content. Such triplexes are very stable, and probes capable of forming triplexes may thus be 
suitable for certain applications. 
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Mainly because the peptide nucleic acid strand is uncharged, a peptide nucleic acid-nucleic 
acid-duplex will have a higher T m than the corresponding nucleic acid-nucleic acid-duplex. 
Typically there will be an increase in T m of about 1 °C per base pair at 100 mM NaCI 
depending on the sequence (Egholm et al. (1993), Nature, 365, 566-568). 

In contrast to DNA-DNA-duplex formation, no salt is necessary to facilitate and stabilise the 
formation of a peptide nucleic acid-DNA or a peptide nucleic acid-RNA duplex. The T m of the 
peptide nucleic acid-DNA-duplex changes only little with increasing ionic strength. Typically for 
a 15-mer, the T m will drop only 5 °C when the salt concentration is raised from 10 mM NaCI to 
1 M NaCI. At low ionic strength (e.g. 10 mM phosphate buffer with no salt added), 
hybridisation of a peptide nucleic acid to a target sequence is possible under conditions where 
no stable DNA-DNA-duplex formation occurs. Furthermore, target sites that normally are 
inaccessible can be made more readily accessible for hybridisation with peptide nucleic acid 
probes at low salt concentration as the secondary and tertiary structure of nucleic acids are 
melted under such conditions. 

Although it is preferred to use peptide nucleic acid probes targeting specific sequences of 
rRNA, it will readily be understood that peptide nucleic acid probes complementary to the 
rRNA targeting probes will be useful for the detection of the genes (DNA) coding for said 
sequence specific rRNA (rDNA). 

In the broadest aspect, the present invention relates to peptide nucleic acid probes for 
detecting the presence of mycobacteria in a sample (claims 1 to 3). Peptide nucleic acids are 
non-naturally occurring polyamides or polythioamides which can bind to nucleic acids (DNA 
and RNA) with sequence specificity as described e.g. in US 5 539 082. 

In accordance with the present invention, peptide nucleic acid probes of formula (I) or a 
mixture thereof as defined in claims 4 to 8 are provided, which probes are useful for detecting 
mycobacteria of the MTC group (claims 4 to 6) or of one or more mycobacteria other than 
mycobacteria of the MTC group in a sample (claims 7 to 8). The probes are directed to 23S, 
16S or 5S rRNA, with the provisos indicated in claims 4 to 8. 

In the present context and the claims, the term "naturally occurring nucleobases" includes the 
four main DNA bases (i.e. thymine (T), cytosine (C), adenine (A) and guanine (G)) as well as 
other naturally occurring nucleobases (e.g. uracil (U) and hypoxanthine). 

The term "non-naturally occurring nucleobases" comprises i.a. modified naturally occurring 



DK4218SA.001\MHC\ 05-05-97 



9 



10 



nucleobases. Such non-naturally occurring nucleobases may be modified by substitution by 
e.g. one or more C„ alkyl, C M alkenyl or C M alkynyl groups or labels. Examples of non- 
naturally occurring nucleobases are purine, 2,6-diamino purine, 5-propynylcytosine (C 
propynyl), isocytosine (iso-C), 5-methyl-isocytosine (iso Me C) (see e.g. Tetrahedron Letters Vol 
36, No 12, 2033-2036 (1995) or Tetrahedron Letters Vol 36, No 21, 3601-3604 (1995)), 7- 
deazaadenine, 7-deazaguanine, N 4 -ethanocytosine, N 6 -ethano-2,6-diaminopurine, 5-(C^ 6 )- 
alkenyluracil, 5-(C M )-alkynylcytQs!ne, 5-fluorouracil and pseudocytosine. 

Examples of useful intercalators are e.g. acridin, antraquinone, psoralen and pyrene. 

Examples of useful nucleobase-binding groups are e.g. groups containing cyclic or 
heterocyclic rings. Non-limiting examples are 3-nitro pyrrole and 5-nitro indole. 

It is to be understood that alkyl, alkenyl and alkynyl groups may be branched or non-branched, 
15 cyclic or non-cyclic, and may further be interrupted by one or more heteroatoms, or may be 
unsubtituted or substituted by or may contain one or more functional groups. Non-limiting 
examples of such functional groups are acetyl groups, acyl groups, amino groups, carbamide 
groups, carbamoyl groups, carbamyl groups, carbonyl groups, carboxy groups, cyano groups, 
dithio groups, formyl groups, guanidino groups, halogens, hydrazine groups, hydrazo groups, 
20 hydroxamino groups, hydroxy groups, keto groups, mercapto groups, nitro groups, phospho 
groups, phosphono groups, phospho ester groups, sulfo groups, thiocyanato groups, cyclic, 
aromatic and heterocyclic groups. 

d.4 groups contain from 1 to 4 carbon atoms, C M groups contain from 1 to 6 carbon atoms, 
25 and C,. 1S groups contain from 1 to 1 5 carbon atoms, not including optional substituents, 

heteroatoms and/or functional groups. Non-limiting examples of such groups are -CH 3 , -CF 3> 
-CH 2 -, -CH 2 CH 3 , -CH 2 CH 2 -, -CH(CH 3 ) 2 , -OCH 3 , -OCH 2 -, -OCH 2 CH 3 , -OCH 2 CH 2 -, -OCH(CH 3 ) 2 , 
-OC(0)CH 3 , -OC(0)CH 2 -, -C(0)H, -C(O)-, -C(0)CH„ -C(0)OH, -C(0)0-, -CH 2 NH 2 , -CH 2 NH-, 
-CH 2 OCH 3 , -CH 2 OCH 2 -, -CH 2 OC(0)OH, -CH 2 OC(0)0-, -CH 2 C(0)CH 3 , -CH 2 C(0)CH 2 -, 
30 -C(0)NH 2 . -CH=CH 2 , -CH=CH-. -CH=CHCH 2 C(0)OH, -CH=CHCH 2 C(0)0-, -C-CH, -C«C-, 
-CH 2 C S CH, -CH 2 C=C-, -CH 2 C S CCH 3 , -OCH 2 C s CH, -OCH 2 C=C-, -OCH 2 OCCH 3 . 
-NHCH 2 C(0)-, -NHCH 2 CH 2 C(0)-. -NH(CH 2 CH 2 0) 2 CH 2 C(0)-, and HO(0)CCH 2 C(0)(NH- 
(CH 2 CH 2 0) 2 CH 2 C(0)) 2 -. phenyl, benzyl, naphthyl, oxazolyl, pyridinyl, thiadiazolyl. triazolyl, 
and thienyl. 



35 



Within the present context, the expression "naturally occurring amino acid" is intended to 
comprise D- and L-forms of amino acids commonly found in nature, e.g. D- and L-forms of Ala 
(alanine). Arg (arginine), Asn (aspargine), Asp (aspartic acid). Cys (cysteine). Gin (glutamine), 
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Giu (glutamic acid), His (histidine), He (isoleucine), Leu (leucine), Lys (lysine), Met 
(methionine), Phe (phenylalanine), Pro (proline), Ser (serine), Thr (threonine), Trp 
(tryptophan), Tyr (tyrosine) and Val (valine). 

In the present context, the expression "non-naturally occurring amino acid" is intended to 
comprise D- and L-forms of amino acids other than those commonly found in nature as well as 
modified naturally occurring amino acids. Examples of useful non-naturally occurring amino 
acids are D- and L-forms of p-Ala (p-alanine) Cha (cyclohexylalanine), Cit (citrulline), Hci 
(homocitrulline), HomoCys (homocystein), Hse (homoserine), Nle (norleucine), Nva 
(norvaline), Orn (ornithine), Sar (sarcosine) and Thi (thienylalanine). 

The strength of the binding between the probe and the nucleic acid sequence may further be 
influenced by the ligand Q. When Q designates a nucleobase, Hoogsteen and/or Watson- 
Crick base pairing assist(s) in the formation of hybrids between a nucleic acid sequence to be 
detected and the probe. It is contemplated that one or more of the ligands may be a group 
which contribute little or none to the binding of the nucleic acid such as hydrogen. It is 
contemplated that suitable probes to be used comprise less than 25% by weight of peptide 
nucleic acid moieties, wherein Q designates such groups. One or more of the ligands Q may 
be groups that stabilise nucleobase stacking such as intercalators or nucleobase-binding 
groups. 

In the above-indicated probes, one or more of the Q-groups may designate a label. Examples 
of suitable labels are given below. Moieties wherein Q denotes a label may preferably be 
located in one or both of the terminating moieties of the probe. Moieties wherein Q denotes a 
label may, however, also be located internally. 

The peptide nucleic acid probes may comprise moieties, wherein all X groups are O 
(polyamides) or wherein all X groups are S (polythioamides). It is to be understood that the 
probes may also comprise mixed moieties (comprising both amide and thioamide moieties). 

In another aspect, the present invention relates to peptide nucleic acid probes of formula (II). 
(Ill) and (IV) as well as mixtures of such probes according to claim 9. 

in a preferred embodiment, the peptide nucleic acid probes or mixtures thereof according to 
the invention are of formulas (l)-(IV) as defined in claim 10 with Z being NH, NCH 3 or O, each 
R 2 , R 3 and R 4 independently being H or the side chain of a naturally occurring amino acid, the 
side chain of a non-naturally occurring amino acid, or alkyl, and each Q being a naturally 
occurring nucleobase or a non-naturally occurring nucleobase with the provisos defined in 
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claims 4 to 8. 



Peptide nucleic acid probes or mixtures of such probes according to the invention are 
preferably those of formula (l)-(IV) as defined in claim 11 with Z being NH or O, and R 2 being 
H or the side chain of Ala, Asp, Cys, Glu, His, HomoCys, Lys, Orn. Ser or Thr, and Q being a 
nucleobase selected from thymine, adenine, cytosine, guanine, uracil, iso-C, and 2.6- 
diaminopurine with the provisos defined in claims 4 to 8. 

Peptide nucleic acid probes or mixtures thereof, which are of major interest for detecting 
mycobacteria of the MTC group or one or more mycobacteria other than mycobacteria of the 
MTC group, are probes of formula (V) according to claim 12, wherein R 4 is H or the side chain 
of Ala, Asp, Cys, Glu, His, HomoCys, Lys, Orn, Ser or Thr, Q is as defined in claim 11 and 
with the provisos indicated in claims 4 to 8. 

The peptide nucleic acid probe comprises polymerised moieties as defined above and in the 
claims. From the formula, it is to be understood that the probe may comprise polymerised 
moieties which structure may be mutually different or identical. It may be advantageous that at 
least one moiety of the probe, preferably one (or both) of the moieties terminating the probe, 
are of a different structure. Such terminating moieties may suitably be a moiety of formula (VI) 



(VI) 



where Q is as defined above. Such moiety may suitably be connected to a peptide nucleic 
acid moiety though an amide bond. 

The peptide nucleic acid probe according to the invention comprises from 6 to 30 polymerised 
moieties of formulas (I) to (V), and, in addition, optionally one or two terminating moieties of 
formula (VI) as defined above. The preferred length of the probe will depend on the sample 
material and whether labelled probes are used. It is contemplated that especially interesting 
probes comprise from 10 to 30 polymerised moieties of formulas (I) to (V), and. in addition, 
optionally one or two terminating moieties of formula (VI) as defined above. Probes of the 
invention may suitably comprise from 12 to 25 polymerised moieties of formulas (I) to (V), 
more suitably from 14 to 22 polymerised moieties of formulas (I) to (V). most suitably from 15 
to 20 polymerised moieties of formulas (I) to (V), and. in addition, optionally one or two 
terminating moieties of formula (VI). 
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In many cases, it may be advantageous to use a mixture of probes, e.g. probes of different 
length and/or probes directed to different domains in the rRNA. Probes labelled with different 
labels may also be applied, thus allowing differentiation between the probes. Thereby, 
mycobacteria of the MTC group and one or more mycobacteria other than those of the MTC 
group may be detected simultaneously. 

As mentioned above, the polymerised moieties of the probes may be mutually different or 
identical. In some embodiments, the polymerised moieties of formulas (V) constitute at least 
75% by weight (calculated by excluding labels and linkers), preferably at least 80% by weight 
and most preferably at least 90% by weight of the probe. 

The ends on the moieties terminating the probe may be substituted by suitable substituents 
which in the following will be named "linkers". A terminating end may suitably be substituted by 
from 1 to 5 linkers, more suitably from 1 to 3 linkers. Such linkers may suitably be selected 
among C M5 alkyl, C,. w alkenyl and C,.„ alkynyl groups as defined above. The linkers may be 
substituted or unsubstituted, branched or non-branched, or be interrupted by heteroatoms, or 
be substituted or contain functional groups as described above. This may depend on the 
chemical nature of the terminating moiety (i.e. whether the moiety is terminated by a carbon, 
oxygen or nitrogen atom). A terminating end or a linker on a terminating end may further be 
substituted by one or more labels, which labels may be incorporated end to end, i.e. so as to 
form a non-branched labelled end, or may be incorporated so as to form a branched labelled 
end ("zipper")- The linkers may be attached directly to a terminating end, may be attached to a 
label or between labels on a terminating end, or be attached to a terminating end before a 
label is attached to a terminating end. It should be understood that two terminating ends may 
carry different or identical substituents. linkers and/or labels. It should further be understood 
that the term "a label" is intended to comprise one or more labels as the term "linkers" is to 
comprise one or more linkers. For certain applications, it may be advantageous that one or 
more linkers are incorporated between the peptide nucleic acid moieties. Such applications 
may in particular be those based on triplex formation. 



30 



Examples of suitable linkers are -NH(CH 2 CH 2 0) n CH 2 C(0)-, -NH(CHOH) n C(0)-, 
-(0)C(CH 2 OCH 2 ) n C(0)- and -NH(CH 2 ) n C(0)-, NH 2 (CH 2 CH 2 0) n CH 2 C(0)-, NH 2 (CHOH) n C(0)-, 
HO(0)C(CH 2 OCH 2 ) n C(0)-. NH 2 (CH 2 ) n C(0)-, -NH(CH 2 CH 2 0) n CH 2 C(0)OH, 
-NH(CHOH) n C(0)OH, -(0)C(CH 2 OCH 2 ) n C(0)OH and -NH(CH 2 ) n C(0)OH, wherein n is 0 or an 
35 integer from 1 to 8, preferably from 1 to 3. Examples of very interesting linkers are 

-NHCH 2 C(0)-, -NHCH 2 CH 2 C(0)-, -NH(CH 2 CH 2 0) 2 CH 2 C(0)-, HO(0)CCH 2 CH 2 C(0)(NH- 
(CH 2 CH 2 0) 2 CH 2 C(0)) 2 -. 
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In the present context, the term "label" refers to a substituent which is useful for detection or 
visualisation. Suitable labels comprise fluorophores, biotin, dinitro benzoic acid, digoxigenin, 
radioisotope labels, peptide or enzyme labels, chemiluminiscence labels, hapten, antigen or 
antibody labels. 

The expression "peptide label" is intended to mean a label comprising from 1 to 20 naturally 
occurring or non-naturally occurring amino acids, preferably from 1 to 10 naturally occurring or 
non-naturally occurring amino acids, more preferably from 1 to 8 naturally occurring or non- 
naturally occurring amino acids, most preferably from 1 to 4 naturally occurring or non- 
naturally occurring amino acids, linked together end to end in a non-branched or branched 
("zipper") fashion. Such peptide label may be composed of amino acids which are mutually 
identical or different. In a preferred embodiment, such a non-branched or branched end 
comprises one or more, preferably from 1 to 8 labels, more preferably from 1 to 4, further 
labels other than a peptide label. Such further labels may suitably terminate a non-branched 
end or a branched end. One or more linkers may suitably be attached to the terminating end 
before a peptide label and/or a further label is attached. Such linker units may also be 
attached between a peptide label and a further label. Furthermore, such peptide labels may be 
incorporated between the peptide nucleic acid moieties. 

The probe as such may also comprise one or more labels such as from 1 to 8, preferably from 
1 to 4, labels and/or one or more linker units, which may be attached internally, i.e. to the 
backbone of the probe. The linker units and labels may mutually be attached as described 
above. 

Examples of particular interesting labels are biotin, fluorescent labels, such as fluorescein 
labels, e.g. 5-(and 6)-carboxyfluorescein, 5- or 6-carboxyfluorescein, 6-(fluorescein)-5-(and 6)- 
carboxamido hexanoic acid and fluorescein isothiocyanate, peptide labels consisting of from 1 
to 20 naturally occurring amino acids or non-naturally occurring amino acids, peroxidases 
such as horse radish peroxidase (HRP) and soya bean peroxidase, dinitro benzoic acid, 
rhodamine, tetramethylrhodamine, cyanine dyes such as Cy2, Cy3 and Cy5, coumarin, R- 
phycoerythrin (RPE), allophycoerythrin, Texas Red and Princeton Red as well as conjugates 
of R-phycoerythrin and, e.g. Cy5 or Texas Red. 

Examples of preferred labels are biotin, fluorescent labels, peptide labels and dinitro benzoic 
acid. Peptide labels may preferably be composed of from 1 to 10, more preferably of from 1 to 
8, most preferably of from 1 to 4, naturally occurring or non-naturally occurring amino acids. It 
may be particularly advantageous to incorporate one or more other labels as well as a peptide 
label such as from 1 to 8 or from 1 to 4 other labels, preferably one or two other labels. 
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Suitable peptide labels may preferably be composed of cysteine, glycine, lysine or ornithine. 

In a further embodiment, the Q substituent as defined above may be labelled. Suitable labels 
are as defined above. Between Q and such a label, a linker as defined above may be 
incorporated. It is preferred that such labelled ligands Q are selected from thymine and uridine 
labelled in the 5-position and 7-deazaguanine and 7-deazaadenine labelled in the 7-position. 

A mixture of peptide nucleic acids is also part of the present invention. Such mixture may 
comprise more than one probe capable of hybridising to 23S rRNA, more than one probe 
capable of hybridising to 16S rRNA, or more than one probe capable of hybridising to 5S 
rRNA as well as more than one probe capable of hybridising to 23S rRNA, and/or more than 
one probe capable of hybridising to 16S rRNA, and/or more than one probe capable of 
hybridising to 5S rRNA. The mixture may also comprise peptide nucleic acids for detecting 
more than one mycobacteria in the same assay. 

The probes may be synthesised according to the procedures described in "PNA Information 
Package" obtained from Millipore Corporation (Bedford, MA, USA), or may be synthesised on 
an Expedite Nucleic Acid Synthesis System (PerSeptive BioSystems, USA). 

If using the Fmoc strategy for elongation of the probe with linkers or amino acids, it is possible 
to retain side chain amino groups protected with acid sensitive protection groups such as the 
Boc or Mtt group. This method allows introduction of a linker containing several Boc protected 
amino groups which can all be cleaved and labelled in the same synthesis cycle. 

One way of labelling a probe is to use a fluorescent label, such as 5-(and 6)-carboxyfluore- 
scein, 5- or 6-carboxyfluorescein, or 6-(fluorescein)-5-(and 6)-carboxamido hexanoic acid. The 
acid group is activated with HATU (O-lT-azabenzotriazol-l-yO-I.I.S.S-tetramethyluronium 
hexafluorophosphate) or HBTU (2-(1H-benzotriazol-1-yl)-1,1,3,3-tetramethyluronium 
hexafluorophosphate) and reacted with the N-terminal amino group of the peptide nucleic acid. 
The same technique can be applied to other labelling groups containing an acid function. 
Alternatively, the succinimidyl ester of the above-mentioned labels or fluorescein 
isothiocyanate may be used directly. 

After synthesis, probes can be cleaved from the resin using standard procedures as described 
by Millipore Corporation or PerSeptive Biosystems. The probes are subsequently purified and 
analysed using reversed-phase HPLC techniques at 50°C and were characterised by matrix- 
assisted laser desorption/ionisation time of flight mass spectrometry (MALDI-TOFMS), plasma 
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clesorption mass spectrometry (PDMS), electron spray mass spectrometry (ESMS), or fast 
atom bombardment (FAB-MS). 

Generally, probes such as probes comprising polymerised moieties of formula (IV) and (V) 
may also be prepared as described in, e.g.. Angewandte Chemie, International Edition in 
English 35, 1939-1942 (1996) and Bioorganic & Medical Chemistry Letters, Vol 4, No 8, 1077- 
1080 (1994). Chemical properties of some probes are described in, e.g., Nature, 365, 566-568 
(1993). 

Detection of the label depend on the type of label and on the format of the procedure. In cases 
where the sample is deposited onto slides, the hybridisation results may be visualised using 
well known immunohistochemical staining methods for detection of the labelling on the probe. 
When fluorescent labelled probes are used, the hybrids may be detected using an antibody 
against the fluorescent label which antibody may be conjugated with an enzyme. The 
fluorescent label may alternatively be detected directly using a fluorescence microscope, or 
the results may be automatically analysed on a fluorescent-based image analysis system. 

When biotin labelled probes are used, the hybrids may be detected using streptavidin or an 
antibody against the biotin label which antibody or streptavidin may be conjugated with an 
enzyme. If necessary, an enhancement of the signal can be generated using commercially 
available amplification systems such as the catalysed signal amplification system for 
biotinylated probes (e.g. DAKO K 1500). 

The probes according to the invention are used in the detection of mycobacteria, in particular 
mycobacteria of the MTC group or one or more mycobacteria other than mycobacteria of the 
MTC group, in samples, in particular sputum samples, which samples may contain these 
bacteria. 

In the assay method according to the invention, a sample to be analysed for the presence of 
mycobacteria is brought into contact with one or more probes according to the invention under 
conditions by which hybridisation between the probe and any sample rRNA or rDNA 
originating from mycobacteria can occur, and the resulting hybridisation is observed or 
measured. 

If necessary, a mixture of random probes (probes with random, not selected sequences 
optionally of different length) may suitably be added in excess in admixture with the probe to 
reduce non-specific binding. 
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In one embodiment of the assay method, one or more probes according to the invention are 
brought into contact with any target rRNA or rDNA inside the cells (in situ) under suitable 
stringency conditions. Prior to this step, smears of the bacterial cells are prepared using 
conventional procedures. Following hybridisation, the complexes formed are detected. An 
example of this assay format is fluorescence in situ hybridisation (FISH), wherein the probes 
according to the invention are labelled with fluorescein or another fluorophore. It might be 
advantageous to use more than one probe. If e.g. three such probes are included in the assay 
each in a concentration of one third of the concentration of a single probe, possible cross 
reactivity of the individual probes is less likely to invalidate the results. 

The test sample may prior to hybridisation be subjected to conditions which release RNA from 
the organisms present in the sample. Such RNA may in solution be brought into contact with 
one or more probes as defined herein, which optionally are labelled, or the RNA of the test 
sample may be immobilised onto a solid phase prior to hybridisation with one or more 
detection probes. The RNA may be immobilised by dotting the RNA onto membranes or by 
using an immobilised capture probe. 

The sample comprising the target nucleic acid can even be added to an assay system 
comprising detection probes as well as immobilised capture probe. The immobilisation of the 
capture probe may be effected by using a streptavidin coated solid phase and a biotinylated 
capture probe. The capture probe may further be immobilised onto a solid support by coupling 
reaction between a carboxylic acid on the linker and an amino derivatised support. 
Alternatively, the coupling onto the solid support may be accomplished by photochemical 
activation of photoreactive groups which have been attached absorptively to the solid support 
prior to photochemical activation. Such photoreactive groups are described in EP 408 078 A. 

In practice, a solid phase based assay system is very attractive as the analysis can be carried 
out using a solid phase precoated with a capture probe. A solid phase based assay system is 
also feasible for automatisation of the analysis. 

The capture probe may be one of the other probes not used in the hybridisation reaction and 
detection step for target nucleic acid, thus ensuring dual species specificity. The dual 
specificity will allow shorter probes be used, e.g. 10 mer probes. 

Furthermore, the capture of purine rich sequences may be improved by utilising bis-peptide 
nucleic acids as capture probes. Such bis-peptide nucleic acids are described in WO 
96/02558. The bis-peptide nucleic acids comprise a first peptide nucleic acid strand capable of 
hybridising in parallel fashion to the target nucleic acid, and a second peptide nucleic acid 
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strand capable of hybridising in antiparallel fashion to the purine rich sequence of the nucleic 
acid to be captured. The two peptide nucleic acid strands are connected by a linker and are in 
this way capable of forming a triplex structure with said purine rich sequence nucleic acid. 

The number of polymerised moieties of each linker-separated peptide nucleic acid may be as 
previously defined for non-bis-peptide nucleic acids. However, due to the high stability of the 
triplexes formed, bis-peptide nucleic acids with short first and second strands can be used 
which will make the design of a pyrimidine rich probe easier. 

The solid support capture system may take a wide variety of forms well known in the art, such 
as e.g. a plate, a microtiter plate having one or more wells, a microscope slide, a filter, a 
membrane, a tube, a dip stick, a strip, beads such as paramagnetic beads, beads made of 
polystyrene, polypropylene, polyethylene, dextran, nylon, amyloses, natural and modified 
celluloses, polyacrylamides and agaroses. When a filter, a membrane, a strip or beads is (are) 
used as the solid support, it (they) may, if conveniently be incorporated into a single-use 
device. 

It has been observed that peptide nucleic acids may bind to a variety of solid phases. A 
blocking reaction may thus be required to reduce non-specific binding of the peptide nucleic 
acids to the solid phase. The blocking reaction may be carried out with commonly used 
blocking reagents, such as BSA (bovine serum albumin), casein, Triton X-100® orTween 20®. 
The preferred blocking reagents are Triton X-100® and Tween 20®. 



The captured probe:nucleic acid complexes may be detected or identified by a wide variety of 
methods for that purpose. The probe to be brought in contact with the target nucleic acid may 
be labelled, and the label may be detected using well known detection systems. In another 
embodiment, the captures probemucleic acid complexes may be detected using a detection 
system based on an antibody reacting specifically with complexes formed between peptide 
nucleic acid and nucleic acid (such as described in WO 95/17430), in which detection system 
the primary antibody may comprise a label, or which detection system comprises a labelled 
secondary antibody, which specifically binds to the primary antibody. 

The present probes further provide a method of diagnosing infection by mycobacteria and a 
method for determining the stage of the infection and the appropriate treatment by which 
methods one or more optionally labelled probes according to the invention are brought into 
contact with a patient sample and the type of treatment and/or the effect of a treatment is (are) 
evaluated. 
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DESCRIPTION OF SPECIFIC EMBODIMENTS 

Examples of suitable Qs of adjacent moieties are given below. Peptide nucleic acid probes 
comprising such Qs will be suitable for detecting mycobacteria, in particular mycobacteria of 
the MTC group or mycobacteria other than mycobacteria of the MTC group. The probes are 
written from left to right corresponding to from the C-terminal end towards the N-terminal end. 
Suitable Q subsequences for detecting 23S and 16S mycobacterial rRNA of the MTC group 
are disclosed in Danish patent applications DK 1096/96 and DK 1156/96. Other suitable Q 
subsequences for detecting 23S and 16S rRNA as well as 5S rRNA of the MTC group are 
given below. Suitable Q subsequences for detecting 23S and 16S rRNA of mycobacteria other 
than mycobacteria of the MTC group are given below. 

MTC group (23S) 

Suitable Q subsequences directed to 23S rRNA of mycobacteria of the MTC group are given 
below. 

AGATGC GGG TAG CAC (selected from position 149-158 in Figure 1A) 
TGT TTT CTC CTC CTA (selected from position 220-221 in Figure 1 A) 
ACT GCC TCT CAG CCG (selected from position 328-361 in Figure 1 B) 
TGA TAC TAG GCA GGT (selected from position 453-455 in Figure 1 B) 
CGG ATT CAC AGC GGA (selected from position 490-501 in Figure 1C 
TCA CCA CCC TCC TCC (selected from position 637-660 in Figure 1C) 
TTA ACC TTG CGA CAT (selected from position 706-712 in Figure 1D) 
ACT ATT CAC ACG CGC (selected from positions 762-789 in Figure 1D) 
CTC CGC GGT GAA CCA (selected from position 989 in Figure 1D) 
GCT TTA CAC CAC GGC (selected from position 1068-1072 in Figure 1E) 
ACG CTT GGG GGC CCT (selected from position 1148 in Figure 1E) 
CCA CAC CCA CCA CAA (selected from position 1311-1329 in Figure 1F) 
CCG GTG GCTTCG CTG (selected from position 1361-1364 in Figure 1F) 
ACT TGC CTT GTC GCT (selected from position 1418 in Figure 1G) 
GAT TCG TCA CGG GCG (selected from position 1563-1570 in Figure 1G) 
AAC TCC ACA CCC CCG (selected from position 1627-1638 in Figure 1G) 
ACC CCT TCG CTT GAC (selected from position 1675-1677 in Figure 1 H) 
CTT GCC CCA GTG TAA (selected from position 1718 in Figure 1H) 
CTC TCC CTA CCG GCT (selected from position 1734-1740 in Figure 1H) 
GAT ATT CCG GTC CCC (selected from position 1967-1976 in Figure 11) 
ATC CCG CCC CAA CTG (selected from position 2403-2420 in Figure 11) 
CTG TCC CTA AAC CCG (selected from position 2457-2488 in Figure 1 J) 
TTC GAG GTT AGA TGC (selected from position 2457-2488 in Figure 1 J) 
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GGT GCA CCA GAG GTT (selected from position 2952-2956 in Figure 1 J) 
CTG GCG GGA CAA CTG (selected from position 2966-2969 in Figure 1K) 
TTA TCC TGA CCG AAC (selected from position 3000-3003 in Figure 1 K) 
GAC CTATTG AAC CCG (selected from position 3097-3106 in Figure 1K) 

MTC group (16S) 

GAA GAG ACC TTT CCG (selected from position 76-79 in Figure 2A) 
CAC TCG AGT ATC TCC (selected from position 98-101 in Figure 2A) 
ATC ACC CAC GTG TTA (selected from position 136-136 in Figure 2A) 
GCA TCC CGT GGT CCT (selected from position 194-201 in Figure 2B) 
GCA TCC CGT GGT CCT (selected from position 194-201 in Figure 2B) 
TAA AGC GCT TTC CAC (selected from position 222-229 in Figure 2B) 
GCT CAT CCC ACA CCG (selected from position 242 in Figure 2B) 
CCG AGA GAA CCC GGA (selected from position 474 in Figure 2C) 
AGT CCC CAC CAT TAC (selected from position 1 136-1 145 in Figure 2C) 
AAC CTC GCG GCA TCG (selected from position 1271-1272 in Figure 2C) 
GGC TTT TAA GGA TTC (selected from position 1287-1292 in Figure 2D) 
GAC CCC GAT CCG AAC (selected from position 1313 in Figure 2D) 
CCG ACT TCA CGG GGT (selected from position 1334 in Figure 2D) 

MTC group (5S) 

Suitable Q subsequences directed to 5S rRNA of mycobacteria of the MTC group are given 
below. 

CGG AGG GGC AGT ATC (selected from position 86-90 in Figure 3) 
Mycobacteria other than those of the MTC group (23S) 

Suitable Q subsequences directed to 23S rRNA of mycobacteria other than those of the Ml 
group are given below. 

GAT CAATGC TCG GTT (selected from position 99-101 in Figure 4A) 

TTC CCC GCG TTA CCT (selected from position 183 in Figure 4A) 

TTA GCC TGT TCC GGT (selected from position 261-271 in Figure 4B) 

GCA TGC GGT TTA GCC (selected from position 281-284 in Figure 4B) 

TAC CCG GTT GTC CAT (selected from position 290-293 in Figure 4B) 

GTA GAG CTG AGA CAT (selected from position 327-335 and 343-357 in Figure 4C) 

GCC GTC CCA GGC CAC (selected from position 400-405 in Figure 4C) 

CTC GGG TGT TGA TAT (selected from position 453-462 in Figure 4D) 

ACT ATT TCA CTC CCT (selected from position 587-599 in Figure 4D) 
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ACG CCA TCA CCC CAC (selected from position 637-660 in Figure 4E) 
CGA CGT GTC CCT GAC (selected from position 704-712 in Figure 4E) 
ACT ACA CCC CAA AGG (selected from position 763-789 in Figure 4F) 
CAC GCTTTT ACA CCA (selected from position 1060-1074 in Figure 4F) 
5 GCG ACT ACA CAT CCT (selected from position 1 177-1 1 85 in Figure 4F) 
CGG CGC ATA ATC ACT (selected from position 1259-1265 in Figure 4G) 
CCA CAT CCA CCG TAA (selected from position 131 1-1327 in Figure 4G) 
CGC TGA ATG GGG GAC (selected from position 1345-1348 in Figure 4G) 
GGA GCT TCG CTG AAT (selected from position 1361-1364 in Figure 4H) 
10 CGG TCA CCC GGA GCT (selected from position 1361-1364 in Figure 4H) 
GGA CGC CCATAC ACG (selected from position 1556-1570 in Figure 4H) 
GAA GGG GAATGG TCG (selected from position 1608-1613 in Figure 41) 
AAT CGC CAC GCC CCC (selected from position 1626-1638 in Figure 41) 
CAG CGA AGG TCC CAC (selected from position 1651-1659 in Figure 41) 
15 GTC ACC CCA TTG CTT (selected from position 1675-1677 in Figure 41) 
ATC GCT CTC TAC GGG (selected from position 1734-1741 in Figure 41) 
GTG tat GTG CTC GCT (selected from position 1847-1853 in Figure 4J) 
ACG GTA TTC CGG GCC (selected from position 1967-1976 in Figure 4 J) 
GGC CGA ATC CCG CTC (selected from position 2006-2010 in Figure 4 J) 
20 AAA CAG TCG CTA CCC (selected from position 2025-2027 in Figure 4J) 
CCT TAC GGG TTA ACG (selected from position 2131-2132 in Figure 4K) 
GAG ACA GTT GGG AAG (selected from position 2252-2255 in Figure 4K) 
TGG CGT CTG TGC TTC (selected from position 2396-2405 in Figure 4 L) 
CGA CTC CAC ACA AAC (selected from position 2416-2420 in Figure 4L) 
25 GAT AAG GGT TCG ACG (selected from position 2474-2478 in Figure 4L) 
ATC CGT TGA GTG ACA (selected from position 2687 in Figure 4M) 
CAG CCC GTT ATC CCC (selected from position 2719 in Figure 4M) 
AAC CTT TGG GAC CTG (selected from position 2809 in Figure 4M) 
TAA AAG GGT GAG AAA (selected from position 3062-3068 in Figure 4N) 
30 GTC TGG CCT ATC AAT (selected from position 3097-31 06 in Figure 4N) 

Mycobacteria other than those of the MTC group (16S) 

Suitable Q subsequences directed to 16S rRNA of mycobacteria other than those of the MTC 
group are given below. 
35 AGA TTG CCC ACG TGT (selected from position 1 35-1 36 in Figure 5A) 
AAT CCG AGA AAA CCC (selected from position 472-475 in Figure 5A) 
GCA TTA CCC GCT GGC (selected from position 1 136-1 144 in Figure 5A) 
TTA AAA GGA TTC GCT (selected from position 1287-1292 in Figure 5B) 
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AGA CCC CAA TCC GAA (selected from position 1313 in Figure 5B) 
GAC TCC GAC TTC ATG (selected from position 1334 in Figure 5B) 

The invention is further illustrated by the non-limiting examples given below. 
EXAMPLES 
EXAMPLE 1 

1 o In situ hybridisation to fixed bacterial cells 

To test the ability of the peptide nucleic acid probes to detect MTC and not mycobacteria other 
than MTC, in particular not mycobacteria of the M. avium Complex, or Neisseria gonorrhoeae, 
fluorescence in situ hybridisation (FISH) was performed on fixed bacterial cells using 
fluorescein labelled probes as shown below. It was shown that these probes did not hybnd.se 

15 to M. avium, M. intracellular, or N. gonorrhoeae. 

Preparation of bacterial slides 

M bovis BCG (Statens Seruminstitut, Denmark. Catalogue number 2645), M. avium (Statens 
Seruminstitut. Denmark. Laboratory number 3716 (E37978)). and M. intracellulars (Statens 
20 Seruminstitut. Laboratory number 3717 (E39562)) were grown in Dubos medium (Statens 

Seruminstitut. Denmark) or on Lowenstein-Jensen medium (Statens Seruminstitut. Denmark) 
at 37 °C. N. gonorrhoeae was grown on chocolate agar at 37 °C with additional 5% C0 2 . 

Bacterial smears were prepared on test slides according to standard procedures. The smears 

25 were air-dried followed by flame fixation. 

FISH on bacterial slides 

The following procedure was performed. 

1 . The slide is immersed in 80% ethanol for 1 5 minutes, subsequently rinsed with water and 
30 air-dried. 

2. The bacterial slide is covered with a hybridisation solution containing the probe in 
question. 

3 The slide is incubated in a humid incubation chamber at 45°C or 55°C for 90 minutes. 

4. The slide is washed 25 minutes in TBS-buffer. pH 10 at 45»C or 55°C. followed by 30 

35 seconds in water. 

5. The slide is dried and mounted (DAKO Fluorescence Mounting Medium or equivalent). 

The following hybridisation solutions was used: 
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Hybridisation 10 mM NaCI 

solution 10% Dextran sulphate 

30% formamide 

0.1% Triton X-100® 

50 mM Tris-HCI, pH 7.6 

50 mM EDTA 

0.1% sodium pyrophosphate 
0.2% polyvinylpyrrolidone 
0.2% Ficol 

TBS buffer 10 mM sodium phosphate, pH 10 

145 mM NaCI 

All solutions are made RNase free following standard procedures. 
The following peptide nucleic acid probes were used 



Lys(Flu)-Lys(Flu)-TTC GAG GTT AGA TGC-NH 2 
Lys(Flu)-Lys(Flu)-ACT CCA CAC CCC CGA-NH 2 
Lys(Fiu)-Lys(Flu)-TCA CCA CCC TCC TCC-NH 2 
Lys(Flu)-Lys(Flu)-AAC TCC ACA CCC CCG-NH 2 
Lys(Flu)-Lys(F!u)-TCA CCA CCC TCC TCC-NH 2 
Lys(Flu)-Lys(Flu)-TTC GAG GTT AGA TGC-NH 2 
Lys(Flu)-Lys(Flu)-CAC AAG ACA TGC ATC-NH 2 



OK 306 
OK 309 
OK 446 
OK 449 
OK 447 
OK 306 
OK 310 



wherein Flu denotes a fluorescein isothiocyanate label or a 5-(and 6)-carboxyfluorescein label, 
and Lys(Flu)-Lys(Flu) denotes a peptide label ("zipper") with two Flu labels attached. All the 
probes are directed to 23S rRNA of the mycobacteria of the MTC group, except OK 310 which 
is directed to 16S rRNA. The results are shown in Table 1. 
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TABLE 1 





45°C 


OK 309 (250nM) 
45°C 


OK 446 (500nM) 
55°C 


OK 449 (SOOnM) 
55°C 


M. bovis BCG 


positive 


positive 


positive 


positive 


M. avium 


negative 


negative 


negative 


negative 


M. intracellulare 


negative 


negative 


not determined 


not determined 


N. gonorrhoeae 


negative 


negative 


not determined 


not determined 





OK447(1^M) 
55°C 


OK 310 (250nM) 
45°C 


OK 306 (500nM) 
OK 310 (SOOnM) 
55°C 


M. bovis BCG 


positive 


positive 


positive 


M. avium 


negative 


negative 


negative 


M. intracellulare 


not determined 


negative 


negative 


N. gonorrhoeae 


not determined 


negative 


not determined 



EXAMPLE 2 



Test in dot blots 

To further test the ability of the peptide nucleic acid probes to detect MTC and not MAC or E. 
coli, dot blot tests were carried out. 

M. bovis BCG (Statens Seruminstitut Catalogue number 2645) and M. intracellulare (Statens 
Seruminstitut, Denmark Laboratory number 3713 (E39562)) were grown in Dubos medium 
(Statens Seruminstitut, Denmark) or on LOwenstein-Jensen medium (Statens Seruminstitut, 
Denmark) at 37 °C. 

i RNA was isolated from the bacterial cells by use of TRI-reagent (Sigma) following 

manufacture's directions. E. coli rRNA was purchased from Boehringer Mannheim. Germany. 

The following peptide nucleic acid probes were used. 

D Lys(Flu)-Lys(Flu)-CTG TCC CTA AAC CCG-NH 2 OK 305 

Lys(Flu)-Lys(Flu)-GTC CCT AAA CCC GAT-NH 2 u * 

Lys(Flu)-Lys(Flu)-ACT CCA CAC CCC CGA-NH 2 OK 309 

Lvs(Flu)-Lys(Flu)-Gly-GCA TCC CGT GGT CCT-NH 2 OK 223 

Lys(Flu)-Lys(Flu)-CAC AGG ACA TGC ATC-NH 2 uiv J iu 
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wherein Flu denotes a fluorescein isothiocyanate label or a 5-(and 6)-carboxyfluorescein label, 
and Lys(Flu)-Lys(Flu) denotes a peptide label ("zipper) consisting of 2 amino acids, 
respectively, with two Flu labels attached. OK 305, OK 307 and OK 309 are directed to 23S 
rRNA of the mycobacteria of the MTC group. OK 223 and OK 310 are directed to 16S rRNA of 
5 the mycobacteria of the MTC group. 



Preparation of dot blots 

The following buffers were used: 



10 20 x SSPE buffer 3 M NaCI 

0.2 M P0 4 3 " 
0.02 M EDTA 
pH 7.4 



15 TST buffer 0.05 M Tris/HCI 

0.5 M NaCI 
0.5% Tween 20® 
pH 9.0 

20 200 ng M. bovis RNA, M. intracellulare RNA and E. coli rRNA were dotted onto membranes 
(Schleich & Schuel, NY 13 N), and the membranes were dried and fixed under UV light for 2 
minutes. Each of the probes (70 nM probe in hybridisation solution (hybridisation solution 
without Triton X-100® and with the exception that formamide was substituted with 50% 
glycerol)) were added to the membrane. Hybridisation was continued for 1 .5 hours at 55 °C. 

25 The membranes were rinsed 2 times for 15 minutes in 2 x SSPE buffer containing 0.1% SDS 
at ambient temperature, and subsequently 2 times for 15 minutes in 0.1 x SSPE buffer 
containing 0.1% SDS at 55 °C or at 65 °C (see Table 2). The membrane was blocked with 
0.5% casein dissolved in 0.5M NaCI, 0.05M Tris/HCI pH 9.0. Thereafter, the membranes were 
incubated for 1 hour with rabbit-anti FITC antibody labelled with AP (DAKO K0046 vial A) 

30 diluted 1 :2000 in 0.5% casein dissolved in 0.5M NaCI, 0.05M Tris/HCI pH 9.0. After incubation, 
the membranes were washed 3 times 5 minutes with TST at ambient temperature. Bound 
probes were visualised following standard procedures using BCIP/NBT, and the visualisation 
was stopped by incubation for 10 minutes with 10 mM EDTA. The blot was dried at 50 °C. 

35 The results are given in Table 2 below. 



DK4218SA.001\MHC\ 05-05-97 



25 



TABLE 2 





E. coli rRNA 


M. bovis BCG RNA 


M. intracellular 
RNA 


Probe 


55 °C 


65 °C 


55 °C 


65 °C 


55 °C 


65 °C 


OK 305 


negative 


negative 


positive 


positive 


negative 


weak 


OK 307 


negative 


negative 


positive 


positive 


negative 


weak 


OK 309 


negative 


negative 


positive 


positive 


negative 


weak 


OK 223 


negative 


negative 


positive 


positive 


nd 


nd 


OK 310 


negative 


negative 


negative 


positive 


negative 


negative 



nd: Not determined 



EXAMPLE 3 

5 

Test of probes on clinical smears of sputum 

The ability of the peptide nucleic acid to bind to mycobacteria of the MTC group was tested on 
clinical smears of sputum prepared by the Division of Microbiology at Ramathibodi Hospital, 
Bangkok, Thailand. Smears from the same patient were initially evaluated positive by Ziehl- 
10 Neelsen staining. Ziehl-Neelsen staining only shows the presence of acid fast bacilli, not 
whether these are mycobacteria of the MTC group. 

The following probes were used. 

1 5 Lys(Flu)-Lys(Flu)-TCA CCA CCC TCC TCC-NH 2 
Lys(Flu)-Lys(Flu)-AAC TCC ACA CCC CCG-NH 2 
Lys(Flu)-Lys(Flu)-TTC GAG GTT AGA TGC-NH 2 
Lys(Flu)-Lys(Flu)-CAC AAG ACA TGC ATC-NH 2 

20 wherein Flu denotes a fluorescein isothiocyanate label or a 5-(and 6)-carboxyfluorescein label, 
and Lys(Flu)-Lys(Flu) denotes a peptide label ("zipper") consisting of 2 amino acids with two 
Flu labels attached. OK 446, OK 449 and OK 306 are directed to 23S rRNA of MTC, whereas 
OK 310 is directed towards 16S rRNA of MTC. Furthermore, a random peptide nucleic acid 
probe (a 15-mer wherein each position may be A, T, C or G obtained from Millipore 

25 Corporation (Bedford, MA, USA)) was used in order to increase the signal-to-noise ratio. 

The clinical smears were prepared according to the procedure described in Example 1. The 
probe concentrations were varied as indicated below in the Table 3. The results are shown in 
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Table 3. 



TABLE 3 



Sample 


OK 446 (1nM) 


OK 449 (1>iM) 


Ziehl-Neelsen 


number 


Random (50jaM) 


Random (50[iM) 


staining 


285 


Positive 


Positive 


4+ 


335 


Positive 


Eq. 


2+ 


345 


Positive 


Positive 


3+ 


224 


Positive 


Positive 


3+ 


297 


Negative 


Eq. 


2+ 


179 


Negative 


Negative 


4+ 


247 


Negative 


Negative 


2+ 


255 


Positive 


Positive 


2+ 


202 


Eq. 


Positive 


2+ 








Sample 


OK 306 (500nM) 


Ziehi-Neelsen 




number 


OK310(500nM) 


staining 




213 


Positive 


4+ 




292 


Positive 


4+ 




159 


Positive 


3+ 




287 


Positive 


3+ 





10 



to the following method: -: 0 baccilli. +/-: 1-200 per 300 fields. 2+: 1-9 per 10 fields, 3+: 1-9 per field, 4 + : 
>9 per field. 

Smears using FISH were examined with a 100x objective and scored according to the following 
method: Positive: Several mycobacteria were identified in the smear. Negative: No fluorescent 
mycobacteria were identified in the smear. Eq: Few (1-3) fluorescent mycobacteria were identified in 
the smear. 



15 



It appears from the table that some of the Ziehl-Neelsen positive smears are MTC-negative. 
The results thus indicate that the peptide nucleic acid probes can separate MTC-positive and 
MTC-negative samples. 



DK4218SA.001\MHC\ 05-05-97 



27 



CLAIMS 



1 Peptide nucleic acid probe for detecting the presence of mycobacteria in a sample, said 
probe being capable of hybridising to target sequences in rRNA or DNA from the area coding 
for said rRNA of the mycobacteria forming detectable target-probe hybrids, 

and a mixture of such probes. 

2 Peptide nucleic acid probe according to claim 1, said probe being capable of hybridising to 
23S, 16S or 5S rRNA or DNA from the area coding for said rRNA of the mycobacteria formmg 
detectable target-probe hybrids, 

and a mixture of such probes. 

3 Peptide nucleic acid probe according to claim 1 or 2 for detecting the presence of 
mycobacteria of the Mycobacterium tuberculosis Complex (MTC), in particular M. 
tuberculosis, or one or more mycobacteria other than mycobacteria of the Mycobacterium 
tuberculosis Complex (MTC), in particular mycobacteria of the Mycobacterium av.um 
Complex in a sample, which probe comprises from 6 to 30 polymerised peptide nucle,c acd 
moieties, said probe being capable of hybridising to target sequences in 23S, 16S or 53 rRNA 
or DNA from the area coding for said rRNA of the mycobacteria to be detected, and a m.xture 
of such probes. 

4 Peptide nucleic acid probe according to any one of claims 1 to 3 for detecting mycobacteria 
of the Mycobacterium tuberculosis Complex (MTC) in a sample, which probe comprises from 
10 to 30 polymerised moieties of formula (I) 




(I) 



wherein each X and Y independently designate O or S, 

each Z independently designates O, S, NR\ or C(R 1 ), wherein each R 1 independently 
designate H, C M alkyl, alkenyl. C v6 alkynyl. 

each R 2 R 3 and R 4 designate independently H, the side chain of a naturally occurring amino 
acid the side chain of a non-natura.ly occurring amino acid, C M alkyl, C M alkenyl or C M 
alkynyl or a functional group, each Q independently designates a naturally occurring 
nucleobase, a non-naturally occurring nucleobase. an intercalator, a nucleobase-bindmg 
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group, a label or H, 

with the proviso that the Qs of adjacent moieties are selected so as to form a sequence of 
which a subsequence includes at least one nucleobase complementary to a nucleobase of 
tuberculosis 23S rRNA that differs from the corresponding nucleobase of M. avium located 
within the following domains 



Positions 149-158 in Figure 1A, 
Positions 220-221 in Figure 1 A, 
10 Positions 328-361 in Figure 1 B, 
Positions 453-455 in Figure 1B, 
Positions 490-501 in Figure 1C, 
Positions 637-660 in Figure 1C, 
Positions 706-712 in Figure 1D t 
1 5 Positions 762-789 in Figure 1 D, 
Position 989 in Figure 1D, 
Positions 1068-1072 in Figure 1E, 
Position 1148 in Figure 1E, 
Positions 1311-1329 in Figure 1F, 
20 Positions 1 361 -1 364 in Figure 1 F, 
Position 1418 in Figure 1G, 
Positions 1563-1570 in Figure 1G, 
Positions 1627-1638 in Figure 1G, 
Positions 1675-1677 in Figure 1H, 
25 Position 1718 in Figure 1H, 

Positions 1734-1740 in Figure 1H, 
Positions 1967-1976 in Figure 11, 
Positions 2403-2420 in Figure 11, 
Positions 2457-2488 in Figure 1 J, 
30 Positions 2952-2956 in Figure 1 J, 
Positions 2966-2969 in Figure 1K, 
Positions 3000-3003 in Figure 1K or 
Positions 3097-3106 in Figure 1K, 



35 



and further with the proviso that the probe comprising such subsequence is able to form 
hybrids with target sequences in 23S rRNA of said mycobacteria, 
and a mixture of such probes. 
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5. Peptide nucleic acid probe according to any one of claims 1 to 3 for detecting mycobacteria 
of the Mycobacterium tuberculosis Complex (MTC) in a sample, which probe comprises from 
10 to 30 polymerised moieties of formula (I) as defined in claim 4, 

with the proviso that the Qs of adjacent moieties are selected so as to form a sequence of 
which a subsequence includes at least one nucleobase complementary to a nucleobase of M. 
tuberculosis 16S rRNA that differs from the corresponding nucleobase of M. avium located 
within the following domains 

Positions 76-79 in Figure 2A, 
Positions 98-101 in Figure 2A, 
Positions 135-136 in Figure 2 A, 
Positions 194-201 in Figure 2B, 
Positions 222-229 in Figure 2B, 
Position 242 in Figure 2B, 
Position 474 in Figure 2C, 
Positions 1136-1145 in Figure 2C, 
Positions 1271-1272 in Figure 2C, 
Positions 1287-1292 in Figure 2D, 
Position 1313 in Figure 2D, or 
Position 1334 in Figure 2D, 

and further with the proviso that the probe comprising such subsequence is able to form 
hybrids with target sequences in 16S rRNA of said mycobacteria, 
and a mixture of such probes. 

6. Peptide nucleic acid probe according to any one of claims 1 to 3 for detecting mycobacteria 
of the Mycobacterium tuberculosis Complex (MTC) in a sample, which probe comprises from 
10 to 30 polymerised moieties of formula (I) as defined in claim 4, 

with the proviso that the Qs of adjacent moieties are selected so as to form a sequence of 
which a subsequence includes at least one nucleobase complementary to a nucleobase of M. 
tuberculosis 5S rRNA that differs from the corresponding nucleobase of M. avium located 
within the following domain 

Positions 86-90 in Figure 3 

and further with the proviso that the probe comprising such subsequence is able to form 
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hybrids with target sequences in 5S rRNA of said mycobacteria, 
and a mixture of such probes. 

7. Peptide nucleic acid probe according to any one of claims 1 to 3 for detecting one or more 
mycobacteria other than mycobacteria of the Mycobacterium tuberculosis Complex in a 
sample, which probe comprises from 10 to 30 polymerised moieties of formula (I) as defined in 

claim 4, 

with the proviso that the Qs of adjacent moieties are selected so as to form a sequence of 
which a subsequence includes at least one nucleobase complementary to a nucleobase of M. 
avium 23S rRNA that differs from the corresponding nucleobase of M. tuberculosis located 
within the following domains 



Positions 99-101 in Figure 4A, 
15 Position 183 in Figure 4A, 

Positions 261-271 in Figure 4B, 
Positions 281-284 in Figure 4B, 
Positions 290-293 in Figure 4B, 
Positions 327-335 in Figure 4C, 
20 Positions 343-357 in Figure 4C, 
Positions 400-405 in Figure 4C, 
Positions 453-462 in Figure 4D, 
Positions 587-599 in Figure 4D, 
Positions 637-660 in Figure 4E, 
25 Positions 704-712 in Figure 4E, 
Positions 763-789 in Figure 4F, 
Positions 1060-1074 in Figure 4F, 
Positions 1177-1185 in Figure 4F, 
Positions 1259-1265 in Figure 4G, 
30 Positions 1311-1327 in Figure 4G, 
Positions 1345-1348 in Figure 4G t 
Positions 1361-1364 in Figure 4H, 
Positions 1556-1570 in Figure 4H, 
Positions 1608-1613 in Figure 4l t 
35 Positions 1626-1638 in Figure 41, 
Positions 1651-1659 in Figure 41, 
Positions 1675-1677 in Figure 41, 
Positions 1734-1741 in Figure 41, 
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Positions 1847- 


-1853 


in 


Figure 


4J, 


Positions 1967- 


•1976 


in 


Figure 


4J, 


Positions 2006- 


•2010 


in 


Figure 


4J, 


Positions 2025- 


-2027 


in 


Figure 


4J, 


5 Positions 2131- 


-2132 


in 


Figure 


4K, 


Positions 2252- 


-2255 


in 


Figure 


4K, 


Positions 2396- 


-2405 


in 


Figure 


4L, 


Positions2416- 


-2420 


in 


Figure 


4L, 


Positions 2474- 


■2478 


in 


Figure 


4L, 



10 Position 2687 in Figure 4M, 
Position 2719 in Figure 4M, 
Position 2809 in Figure 4M, 
Positions 3062-2068 in Figure 4N, or 
Positions 3097-3106 in Figure 4N, 



15 

and further with the proviso that the probe comprising such subsequence is able to form 
hybrids with target sequences in 23S rRNA of said mycobacteria, 
and a mixture of such probes. 

20 8. Peptide nucleic acid probe according to any one of claims 1 to 3 for detecting one or more 
mycobacteria other than mycobacteria of the Mycobacterium tuberculosis Complex in a 
sample, which probe comprises from 10 to 30 polymerised moieties of formula (I) as defined in 
claim 4, 

25 with the proviso that the Qs of adjacent moieties are selected so as to form a sequence of 

which a subsequence includes at least one nucleobase complementary to a nucleobase of M. 
avium 16S rRNA that differs from the corresponding nucleobase of M. tuberculosis located 
within the following domains 

30 Positions 135-136 in Figure 5A, 

Positions 472-475 in Figure 5A t 

Positions 1136-1144 in Figure 5A, 

Positions 1287-1292 in Figure 5B, 

Position 1313 in Figure 5B, or 
35 Position 1334 in Figure 5B, 

and further with the proviso that the probe comprising such subsequence is able to form 
hybrids with target sequences in 16S rRNA of said mycobacteria, 
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and a mixture of such probes. 

9. Peptide nucleic acid probe according to any one of claims 1 to 8 of formula (II), (HI), or (IV) 

Q 




R 4 

wherein Z, R 2 , R 3 , and R 4 , and Q is as defined in claim 4, and a mixture of such probes. 

10. Peptide nucleic acid probe according to any one of claims 1 to 9, wherein Z is NH, NCH 3 
or O, each R 2 , R 3 and R 4 independently designate H or the side chain of a naturally occurring 
amino acid, the side chain of a non-naturally occurring amino acid, or C,_ A alkyl, and each Q is 
a naturally occurring nucleobase or a non-naturally occurring nucleobase with the provisos 
defined in claims 4 to 8, and a mixture of such probes. 

11. Peptide nucleic acid probe according to any one of claims 1 to 10, wherein Z is NH or O, 
and R 2 is H or the side chain of Ala, Asp, Cys, Glu, His, HomoCys, Lys, Orn, Ser or Thr, and 
Q is a nucleobase selected from thymine, adenine, cytosine, guanine, uracil, iso-C and 2,6- 
diaminopurine with the provisos defined in claims 4 to 8, and a mixture of such probes. 

12. Peptide nucleic acid probe according to any one of claims 1 to 11 of formula (V) 

Q 
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wherein R 4 is H or the side chain of Ala, Asp, Cys, Glu, His, HomoCys, Lys, Orn, Ser or Thr, 
and Q is as defined in claim 1 1 with the provisos defined in claims 4 to 8, and a mixture of 
such probes. 

13. Peptide nucleic acid probe according to any one of claims 1 to 12 further comprising one 
or more labels and a mixture of such probes according to any one of claims 1 to 12 further 
comprising one or more labels which may be mutually identical or different, which probes 
optionally may comprise one or more linkers which may be mutually identical or different with 
the provisos defined in claims 4 to 8. 

14. Peptide nucleic acid probes according to any one of claims 1 to 6 and 8 to 13, wherein the 
Qs of adjacent moieties are selected so as to form the following subsequences 



TTC GAG GTT AGA TGC, 



(OK 306) 



ACT CCA CAC CCC CGA, ( OK 309 > 

TCA CCA CCC TCC TCC, (OK 446) 

AAC TCC ACA CCC CCG, ( OK 449 > 

TCA CCA CCC TCC TCC, (OK 447) 



TTC GAG GTT AGA TGC, 



(OK 306) 



CAC AAG ACA TGC ATC, ( OK 31 0) 

CTG TCC CTA AAC CCG, ( OK 305) 

GTC CCT AAA CCC GAT, or ( OK 307 > 

GCA TCC CGT GGT CCT, ( OK 223 > 

and mixtures of such probes. 

15. A mixture of peptide nucleic acid probes according to claims 1 to 14. 

16. Use of a peptide nucleic acid probe or a mixture thereof according to any one of claims 1 
to 15 for the detection of the presence of mycobacteria in a sample. 

1 7. Use of a peptide nucleic acid probe or a mixture thereof according to any one of claims 1 
to 6 and 9 to 15 for the detection of the presence of mycobacteria of the Mycobacterium 
tuberculosis Complex (MTC), in particular M. tuberculosis. 

18. Use of a peptide nucleic acid probe or a mixture thereof according to any one of claims 1 
to 3, 7 to 1 3 and 1 5 for the detection of the presence of one or more mycobacteria other than 
mycobacteria of the Mycobacterium tuberculosis Complex, in particular mycobacteria of the 
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Mycobacterium avium Complex. 

19. Method for detecting mycobacteria in a sample comprising 

(1) contacting any rRNA or rDNA optionally present in said sample with one or more 
peptide nucleic acid probes according to anyone of claims 1 to 15 under conditions, 
whereby hybrids between said probe(s) and said rRNA are formed, and 

(2) observing or measuring said hybridisation, and relating said observation or 
measurement to the presence of mycobacteria in said sample. 

20. Method according to claim 19 for detecting mycobacteria of the Mycobacterium 
tuberculosis Complex (MTC), in particular M. tuberculosis. 

21 . Method according to claim 1 9 for detecting one or more mycobacteria other than 
mycobacteria of the Mycobacterium tuberculosis Complex. 

22. Method according to any one of claims 19 to 21, wherein the contact between any rRNA 
optionally present in the sample and one or more peptide nucleic acid probes according to 
anyone of claims 1 to 14 or mixtures thereof according to claim 15 takes place inside the cells 
(in situ). 

23. Method according to any one of claims 19 to 21, 

characterised in that the hybridisation is performed in solution and the hybrids are 
captured on a solid phase before measuring the extent of hybridisation. 

24. Method according to claim 23, 

characterised in that a peptide nucleic acid probe according to any one of claims 1 to 
15 are used for capturing the hybrids. 

25. Method according to any one of claims 19 to 21, 

characterised in that RNA of the test sample is immobilised onto a solid phase prior to 
performing step (1). 

26. A method according to any one of claims 19 to 25, 

characterised in that a signal amplifying system is used for measuring the resulting 
hybridisation. 
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27. Kit for detecting mycobacteria, in particular mycobacteria of the Mycobacterium 
tuberculosis Complex (MTC), in particular M. tuberculosis, and/or one or more mycobacteria 
other than mycobacteria of the Mycobacterium tuberculosis Complex, in particular 
mycobacteria of the Mycobacterium avium Complex, 

5 characterised in that said kit comprises at least one peptide nucleic acid probe 
according to any one of claims 1 to 14, and optionally a detection system with at least one 
detecting reagent. 

28. Kit according to claim 27, 

10 characterised in that it further comprises a solid phase capture system. 
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ABSTRACT 

NOVEL PROBES FOR THE DETECTION OF MYCOBACTERIA 

Novel hybridisation assay probes and mixtures of such probes for detecting the presence of 
rRNA originating from mycobacterial species. The probes may suitable be directed to 23S, 
16S or 5S rRNA of said mycobacteria. Such probes are capable of detecting the organisms in 
test samples, e.g. expectorates, sputum, aspirates, urine, blood and tissue sections, food, soil 
and water. 
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130 140 150 160 

109 3 GGGG* AA CCCAGCACGAGTGA-G TCGTg H^ CCC gCAI^v M . tuberculosis 
a 2 2 GGGG§AA CCCAC-CACGAGTGATC-TCGTGTTACCCGTATCT M . av i urn 
122 GGGGb'^aCCCac-CACGAGTGATC-TCGTGTTACCCGTATCT M . paratucerc . 
= 07 GGGG^AACCCGGCACGAGTGATGTCGTGTCACCCAACGCT M . phlei 

1 * 2 GGGGW CCCAACACGAGTCACGTCC-TG^ACCCGTATCT M . 1 epr ae 

2 07 GGGGAJLACCCAGCACGAGTAA-C-CGTGTTACCCGTATCT M . gastr 1 
150 GGGGAAACCCAGCACGAGTGAT 37 Z 3TGTTACCCGCATCT 1*1 . Kansas- 

2588 GGGGAAACCCX:-3CACGAGTGA-3TC3TGTCACCAGGCGCT M . s~eg~atis 



• • • 



. r— i i 

210 220 230 240 

1172 CATCTCAGTACCCG-AC-GAiaAC-AAAACAATTGTGAl"rc-(!; M . tuberculosis 

501 CATCTCAGTACCCGTAGGAGA^X-AAAACAATTGTGATTCC M . avium 

501 C^TCTCAGTACCCGTAGGAGAAGAAAACAATTGTGATTCC M.paratuberc . 

586 -CATCTCAGTACCCGTAG^.GAAGAAAACAATTGTGATTCC M . phlei 

511 CATCTCAGTACCCGTAGGA.GAAGAAAACAATTGTGATTCC M . leprae 

286 CATCTCAGTACCCGTAGGAGAAGAAAACAAASgTGATTCC M . gastri 

229 CATCTCAGTACCCGTAGGAG^>.GAAAACAA^A|GTGATTCC M . kansasn 

2 66 7 CATCTCAGTQCCCGTAGGAgGAGAAAACAAgTGTGATTCC M . smeginatis 



Figure 1A 
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33 0 34 0 35 0 360 

1239 -aTaaa Ai5-GA-ATGTCTCAG r£r-ACCCGGCTGAGA-GG M . tuberculosis 
617 TGTGGGA^CO -ATGTCTCAGCT:CTACCT;GGCTGAG:3 - C-G M . avium 

TGTGGGATTbATATGTCTCAC-CTCTACCJC-GCTGAGG-GG M . carstuberc . 



617 
703 
629 




4 04 TGTGGGATCGATACC-TCTCAGCTCTACCCGGCTGAdd-GG M . gastri 
347 TGT'GGGATCGA7ACG7CTCAGC7;C7ACCCGGCTGAGG-GG M . kansasii 

2 7 8 5 TGTGGGAtCTA-CT-TC-CGCCTCTACCI-C-GCTG'^GAgGG M . srne grr. a t i s 



37 0 3 SO 3 90 400 



132 7 3^.GTCAGAAAC-T3TC3TGGT7AGCGGAAGTGGCCTGGGAT 'A . tuberculosis 

6 5 6 TAGTCAGAAA6T3TC 3TGGT7AG CGGAAGTGGCCTGGGAQ M . a v i un 

6 56 TAGTCAGAAAG 73TC GTGGTT.-.GCGGAAGTGGCCTGGGAC M - paratuberc . 

74 2 TAGT|AgAAAGCAG7 GTGGTTAGGTGAAGTGGCCTGGGAT M . ph lei 

6 68 jrAGTCAGAAAGTG^CGTGGTT-AGCGGAAATGGCCTGGGAT M . leprae 

4 43 CAGTCAGAAAGTGTCGTGGTTAACGGAAGTGGCCTGGGAT M . gastri 

3 86 CAGTCAGAAAGTGTCGTGGTTA^CGGAAGTGGCCTGGGAT M .kansasii 

2 823 CAGTCAGAJLAA73T73TGGT7AGCGGAJL3TGGCTTGGGAT M . smeg-.at is 



450 460 470 480 

1406 CGGCACCTGCC7lG~ATCAATTCCCGAGTAGCAGCGGGCC M . tuberculos l s 

735 CGGCACCTGCCTTATATCAACSCCCGAGTAGCAGCGGGCC M . a v ium 

735 CGGaACCTGCCTT ATATCAJIC ACCCC-AGTAGCAC-CGC-GCC M . paratuberc . 

820 5GgTCCq^GC?G TCACAGG-'-[ TCCCGAGTAGCAGCGGGCC M . phlei 

747 yGGCACCTGCC7^G7ATCAATTCCCGAGTAGCAGCGGGCC M . 1 epr ae 

522 CGGCA.CCTGCC7*TG7A.TCAATTCCCGAGTAGCAGCGGGCC M . gastri 

4 65 CGGCACCTGCCT7-G7ATCAATTCCCGAGTAGCAGCGGGCC M . kansasii 



2 902 CGgOgljCTG^CTTG ATGGTG TTCCCGAGTAGCAGCGGC-CC M . smegma t i s 
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490 500 510 520 

1AA6 or.Tr-na&Tr fec-GTC-^ M . tuberculosis 

7 7 5 CGTGGAATCTGCTGTGAATCfC-CCGGGACCACCCGGTAAG M .avium 

7 7 5 CGTGGAATGTC-CTGTGAATCTGCCGGGACCACCCGGTAAG- M . paratuberc . 

8 57 CGTGGAATGTGCTGTGAATCTGCCGGGACCACCCGGTAAG M . phlei 
7 8 7 CGTGGAATdTGCTGTGAATC^GCCGGGACCACCCGGTAAG M . leprae 
5 6 2 CGTGGAATCTC-CTGTC-AATCTC-CCGGGACCACCCGGTAAG M . gastri 

5 0 5 CGTGGAATCTGCTGTC-AATCTG-CCGGGACCACCCGGTAAG M.kansasil 

2 94 2 CGTGGAATC^GCTGTGAATqTGCCGGGACCACCCGGTAAG M . smegmat i s 



e • o 



610 

! 



15 6 6 GTACCTGAAACCGTG 

8 94 GTACCTGAAACCGTG': 
894 GTACCTGAAACCGTG 
97 6 GTACCTGAAACCGTG 

9 07 GTACCTGAAACCGTG 
682 GTACCTGAAACCGTG 
625 GTACCTGAAACCGTG 



620 630 640 

:GCCTACAATCCGTCAGAGCCECTI M. tuberculosis 
TGCCTACAATCCGTCAGAGCCTCCT M . avium 
TGCCTACAATCCGTCAGAGCCTCCT M . paratuberc . 
TGCCTACAATCCGTCASaGCCCTCT M . ph 1 e i 
TGCCTACAATCCGTCAGAGCCTCCT M. leprae 
iGCCTACAATCCGTCAGAGCCjCjCjTT M . gastri 
TGCCTACAATCCGTCAGAGCCCTTT M . kansas i i 



3 062 GTACCTGAAACCGTGCGCTTACAATCCGTCAGAGCqC5Cg M . smegma t i s 



650 



670 680 

1606 TTTCCTCTCCGGAGC-AGGGTGGTGATGGCGTGCCTTTTGA M . tuberculosis 



660 



934 
934 
1016 
947 



GTGSGGTGATGGCGTGCCTTTTGA M . avium 
! GTGyGGTGATGGCGTGCCTTTTGA M . paratuberc . 



jT^ GTAGTQ3GGTGATGGCGTGCCTTTTGA M . phlei 

tTT --tGTGGGGTGATGGCGTGqiiTTTTGA M . leprae 

722 Tj ! g[tGGGGTGATGGCGTGCCTTTTGA M. gastri 

66 5 £ jGpjGj^GGTGATGGCGTGCCTTTTGA M.kansasii 

3102 gCGTG ?, ' GTGiqGGTGATC-GCGTGCCTTTTGA M.smegmatis 
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690 700 710 720 

16 4 6 a c-^TGAGCCTGCC-AGTCAGC-GAC .^STCk^-C-GT-^ M . tuberculosis 

95 9 AGAATGA GCCTC-CGAGTCAGGGACAC;GTCGCC-AGGTTAAC M . avium 

9 5 9 ^GAATGAGCCTGCGAGTCA.GGGACACGTCGCjGAGGTTAAC M . paratuberc . 

10*6 AC-a ATGAC-CCTGCGAC-TCAGC-GACATGTCGCGAGGTTAAC M . phlei 

9 7 2 AGAATGAGCCTGCGAGTCAGGGACATGTCGCGAGGTTA^C M . 1 epr ae 

7A 7 AGAATG^GCCTC-CC-AGTCAGC-GACATGTCGCpAGGTTAAC M. gastn 

6 9 0 AGAATGAGCCTC-CC-AGTCAC-GGACATGTCC-CpAGGTTAAC M . kansasii 

3132 AGAATGAGCCTGCGAGT'CAGGGACATGTCGCjGAGGTTAAC M. smegmatis 



e • • 
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770 780 790 800 

1726 C GACCCACACGCC-CATA CGCGCGTGTGAAt rAC-TGGCGTGT M . tuberculosis 

103 9 CGCATCCCCTT7GGG ;gtGTAGTGGCGTGT M. avium 

1039 Cg'cATC'CCTTTTGGG jGTGTAGTGGCGTGT M . paratuberc . 

1126 CGTATjc^AC£TGTT GgGpgTG^TAGTGGgGTGT M . phi e i 

1052 CGTATCACG^GTC-AGCGT jGTGTAGTGGCGTGT M. leprae 

827 CGTATiCAC^CGTAAGCGT- - -jGTGTAGTGGCGTGT M . gas t r i 

7 70 CG?AT^qGCC-CGAGCGT ;gTGTAGTGGCGTC-T M . kansas i i 



3212 CGTAT[C CACAC.AAG A C-TGTGTC- j GTQTAGTGGTjGTGT M . smegmatis 



970 980 990 1000 

i i ! L. 

1926 ATTTAGGTGCAGCGTTGCGTGGTTCACdStGGAGGTAGAG M . tuberculosis 

1228 ATTTAGGTGCAGCGTTGCGTGGTTCACG&CGGAGGTAGAG M . avium 

122 8 ATTTAGGTGCAGCGTTGCGTGGTTCACCjACGGAGGTAGAG M . paratuberc . 

1322 ATTTAGGTGCAC-CGTgGCATG^TT CTTA' ICGGAGGTAGAG M . phlei 

1244 ATTTAGGTGCAC-CGTTGCGTC-GTTCACCjACGGAGGTAGAG M. leprae 

1019 ATTTAGGTGCAGCGTTGCGTG^TTCACCKCGGAGGTAGAG M.gastri 

962 ATTTAGGTGCAC-CGTTGCGTGjl|TTCACqA|cGGAGGTAGAG M . kansasii 

3408 ATTTAGGTGCAGCGTCGCATG$TT CTTGCj CGGAGGTAGAG M. smegmatis 
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1050 1060 1070 1080 

2005 CAGCCAAACTCCGAATGCCG - TC-GTG -JTA^A^SCG'i'GGL'A M . tuberculosis 
1307 CAGCCAAACTCCGAATGCCG - TGG7G - TAAAAGCGTGGCA M . avium 

13 07 CAGCCAAACTCCGAATGCCG - T GGTG - TAAAAGCGTGG CA M . paratuberc . 

14 01 CAGCCAAACTCCC-AATGCCC-ATAAG- - TGAAAG^GTC-GCA M . phlei 
1323 CAGCCAAACTCCGAATGCCG - TGGTT - TAAAAGCGTGGCA M .leprae 
1098 CAGCCAAACTCCGAATGCCG - TGGTG - TATjAj-jGCGTGGCA M . castri 
1041 CAGCCAAACTCCGAATGCCG - TGGTG - TATA-!gCGTGGCA M . kansasii 

34 86 CAGCCAAACTCCGAATGCCGpT^GgCAAGAGgGgGGgA M . smegir.atis 



• • • 



1 1 i ■ 

1130 1140 1150 1160 

2 082 ACAGCCCAGATCGCCGGCTAAGGCCCCC^AGCGTGTGCTA M . tuberculos l s 

13 85 ACAGCCCAGATCGCCGGCTAAGGCCCCTAAGCGTGTGCTA M. avium 
1385 ACAGCCCAGATCGCCC-GCTAAGGCCCCt|aAGCGTGTGCTA M . paratuberc . 
1479 ACAGCCCAGATCGCCGGCTAAGGCCCCTjAAGCGTGTGCTA M . ph 1 e i 

14 01 ACAGCCCAGATCGCCGGCTAAGGCCCC^AAGCGTGTGCTA M. leprae 
1175 ACAGCCCAGATCGCCGGCTAAGGCCCCAAAGCGTGTGCTA M.gastri 
1118 ACAGCCCAGATCGCCGGCTAAGGCCCCA^AGCGTGTGCTA M. kansasii 

3566 ACAGCCCAGATCGCCGGTjTAAGGCCCCgAAGCGTgTG^TA M.smegmatis 
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T 



1290 1300 1310 1320 



2241 CTCAAGCACACC GCCGAAGCCGCGG^ 

. . r T ^r<7\rpn r r r rO T TI>! Wi avium 



IS** CTCAAGCACACCGCCG.^GCCGCGC-CA^CA^CTT ri T^ M . paratuberc . 
1638 CTCAAGCACACCGCCGAAGCCGCGGCAj^TjCAgqgi^Q M.phlei 
15 6 0 CTCAAGCACACCGCCGAAGCCGCGGCACAT^rCACCTT^A M . leprae 
133^ CTCAAGCACACCGCCGAAGCCGCGACA^->COC-C--A M. gastri _ _ 
12 77 CTCAAGCACACCC-CCGAAGCCC-CC-ACA^-^ACqGC- -A M . kansas 1 l 

3 7 2 6 TTCAAGCACACCC-CCC-AAGCCGCGGAA^CCA>:cgT2TG M . smegma t i s 



1330 1340 1350 1360 

2 2 8 0 ^^^3^G-GC-TAGGGGAGCGTCCCTcAtTCAGCGAAG M. tuberculosis 
1583 BGGTGGStGTGGC-TAGGGGAC-CGTCCCCCATTCAGCGAAG M. avium 

15 83 'ggGTGG^TGTGGGTAGGGGAGCGTCCC^CATTCAGCGAAG M . paratuberc . 
1676 WGGCTGGTGTGGGTAGGGGAGCGTCqrGCATgC§GTGAAG M . phlei 

16 00 Ic^GTGGSTGTGGGTAGGGGAGCGTTCCTCATTCAGCGAAG M. leprae 
13 67 a'gGTV^- ~ ITGGGTAGGGGAGCGTCCCTCATTCAGCGAAG M . gastri 
1310 AGGT 1 !TGGGTAGGGGAGCGTCCCTCATTCAGCGAAG M. kansasil 



3764 TT--- 



STGGGTAGGGGAGCGTCqTG-AljcC^GSGAAG M . smegmatis 



-i 1 r 



-i r 



1370 1380 1390 1400 

2319 ICT^D CGGGTG ACCGGTGGTGGAGGGTGGGGGAGTGAGAAT M . tuberculosis 
1623 CT^CCGGGTGACCGGTGGTGGAGGGTGGGGGAGTGAGAAT M . avium 
1623 qr JcCGGGTGATCGGTGGTGGAGGGTGGGGGAGTGAGAAT M . paratuberc . 
1716 C^dcCGSGTGATCGGTGGTGGAGGGTGgGGGAGTGAGAAT M.phlei 
164 0 CClicCGGGTAACCGGTGGTGGAGGGTGGGGgAGTGAGAAT M. leprae 
14 02 CCycCGGGTGACCGGTGGTGGAGGATGGGGGAGTGAGAAT M . gastri 
1345 qrqCCGGGTGACCGGTGGTGGAGGATGGGGGAGTGAGAAT M . kansasil 

3796 CCGCCC-SGTAT63AGTGGTGGAGGGTG3GGGAGTGAGAAT M.smegmatis 
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1410 1420 14 3 0 144 0 

2359 GC*GGCATGAGTAGCG^GGCAA GTGAGAACCTTGCeC M . tuberculosis 
IS 62 GC^C»T(»GTAGCGA^^-CAAGTCaC^CCTTCCCC M . avium 
1662 GC^ GGCATGAGTAGCGATAAGGCAAGTGAGAACCTTGCCC M . paratuberc . 
1756 GCAGGCATGAGTAGCGATA-AGGCAAGTGAGAACCTTgCCC M.phlei 
1680 GCAGGCATGAGTAGCGATAAGGCAAGTGAGAACCTTGCCC M . leprae 
1AA 2 GCAGGCATGAGTAGCGATAAGGCAAGTGAGAACCTTGCCC M.gastri^ 
1385 GCAGGCATGAGTAGCGATAAGGCAAGTGAGAACCTTGCCC M.kansasii 

3836 GCAGGCATGAGTAGCGATTAGC-CAAGTGAGAACCTTSCCC M . smegmatis 



e o • 
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1570 1580 1590 16 0 ° 

2519 rr. r^C:GTGAO C-^TCA- C-CGG TACTAACCACCCAAAACCG M . tuberculosis 
1821 CGTCCQTGATGAATCA-GCGGTACTAACCACCCAAAACCG M. avium ^ 
T821 CGTCCGTGA'TG.-ATCA-GCGGTACTAACCACCCAAAACCG M . paratuoerc . 
1915 CGMccldTGATGA^T CTCATTC TC-CTAACCACCCAAAACCg M.phlei 
18*0 CGCCCGTGATGAATCA-GCGGTACTgACCACCCAAAACCG M . leprae 
1602 CGCCCGTGATC^ATCA-GCGGTACTAACCACCCAAAACCG M.gastri _ _ 
1545 CGCCCGTC-ATC^ATCA-GCGGTACTAACCACCCAAAACCG M . kansasil 

3 996 CGSCCgTGAgGAATCA-GCGGTACTAACCAgCCAAAACC^ M . smegmatis 

• 1 1 — i 1 r 

1610 1620 1'63 0 164 0 

2558 c-AT- CGATCAC-TCCCCTTCC-GG C-G-pTaGAG^EpG M . tuberculosis 

I860 GAT - CGA^CAT - TCCCCTTCGGGGGC -jGTGGSGjATQ - p|GG M . avium 

I860 GAT - CGA'dcAT - TCCCCTTCGGGGG;C - GTGGCGA^ljlj-jGGG M. paratuberc . 

1955 GG^-CGATC^-ATCCQTTCGGGGg^^ M.phlei 

1879 GAT - CGA0CATATCCCCTTCGGGGGGT§TGGAGGTj^ - gGG M. leprae 

1641 GAT - CGATCAO- TCCCCTTCGGGGGA"^jGTGGAGGTC - TGG M.gastri 

1584 GAT-CGATCAC-TCCCCTTCGGGGGcJgTGGAG^TC-TGG M.kansasii 

4 03 5 'ACC^GAgCgC^CC^jTTCGGGGy-TGTGGgGTT^TGG M . smegmatis 



Figure 1G 



8/32 



— j ] i r 

1650 1660 1670 1680 

2594 GGCTGCGTGGGA^XTirGCTGGTAGTAGTCAAG dBSa SGG M . tuberculosa s 
1896 GGCTGCGTGGGACCTTCGCTGGTAGTAGTCAAGC^ATGGG M . avium 

18 9 6 GGCTGCGTGGGACCTTCGCTGGTAGTAGTCAAGc{?jA|l}GGG M . paratuberc . 

19 8 6 GGCTGCGTGGGACC^G-GTGGGTAGTAGTCAAGCGATGGG M . ph 1 e i 
1917 GGCTGCGTGGGAACTTCGTTGGTAGTAGTCAJlGCGATGGG M . leprae 
16 77 GGCTGCGTGGAGCCTTCGCTGGTAGTAGTCAAGCGAjljGGG M.gastri 
16 2 0 GGCTGCGTGGjAGCCTTCGCTGGTAGTAGTCAAGCGA|l|GGG M . kansasi i 

4 0 71 GGCTGCATGGGACCTTCGTjTGGTAGTAGTCAAGCGATGGG M . smegma t i s 

j - , j r 

1690 1700 1710 1720 

2634 -GTGACGCAGGAiGGTAGCCGTACCAGTCAGTGGTAAO^- M . tuberculosis 

193 6 - GTGACGCAGGAAGGCAGCCGTACCAGTCAGTGGTAAjTA- M . avium 

19 3 6 -GTGACGCAGGAAGGCAGCCGTACCAGTCAGTGGTAJ^TA- M . paratuberc . 

2 02 5 - GTGACGCAGGAAGG7AGCCGTACCAGTCAGTGGTAATA- M . ph 1 e i 

1957 - GTGACGCAGGA2LGGTAGCCGTACCAGTCAGTGGTAATA- M. leprae 

1717 -GTGACGCAGGAAGGCAGCCGTACCAGTCAGTGGTAATA- M.gastri 

166 0 - GTGACGCAGGA^.GGCAGCCGTACCAGTCAGTGGTAATA- M.kansasii 

4111 - GTGACGCAGGAAGGTAGCCGTACOgGTCAGTGGTAATA- M . smegmat i s 

1 1 1 r 

1730 1740 1750 1760 

1 _i_ i i_ 

2672 - CTGGGGCAAGC OiG i'AGGGR GAGCGATAGGCAAATCCGT M . tuberculosis 

19 74 - CTGGGGCAAGCCCGTAGj- -jAGAGCGATAGGCAAATCCGT M . avium 

19 74 - CTGGGGCAAGCCjCGTAG—AGAGCGATAGGCAAATCCGT M . paratuberc . 

2 063 - CgGGGGJAAACCTGTAGGGgGAG^GATAGGCAAATCCGT M . phlei 

1995 -CTGG§GCAAGCCc|g7AGGGAGAGCGATAGGCAAATCCGT M. leprae 

1755 - CTGGGGCAAGCCAGTAGGGAGAGCGATAGGCAAATCCGT M.gastri 

1698 -CTGGGGCAAGCC^GTAGGGAGAGCGATAGGCAAATCCGT M.kansasii 

4149 -CgGG|QG3AAGCCgGTAGGGAGiTCAGATAGGTAAATCCGT M.smegmatis 
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1970 1980 1990 2000 



■ i l 

2908 AGGGGC ^CCGGAATATC GTGAACACCCTTGCGGTGGGAGC M . tuberculosis 



22 08 AGGGGGCCCGGAAT 
2 2 0 8 AGGGGg'cC CGGAAT 



ACCGTGAACACCCTTGCGGTGGGAGC M. avium 
ACCGTGAACACCCTTGCGGTGGGAGC M . paratuberc 
AC CGTGAGGG CT CTTG CGGCGGC AG C M . phlei 



2 298 AGGGGGAC C CACG T. _ _ _ 

2231 AGGGGGgCCGGAATATCGTGAACACCCTTGCGGTGGGAGC M. leprae 
1910 ' ' M • g^stri 

1934 AGGGGGACCGGPATACCGTGA2LCACCCTTGCGGTGGGAGC M . kansasii 

4 3 85 AGGGGGACCCACATCGCGTGTAAGCCTTTACG G^CA AGC M.smegmatis 



, 1 1 r 

2410 2420 2430 2440 

3345 AC fcTCGACGCCAG^-GGSSn SGAGTCGTTGTTGAAATACC M . tuberculosis 

2 64 5 GGA^GACGCCAGTTTGljGTGGAGTCGTTGTTGAAATACC M . avium 

2 64 5 bJciACAGACGCCAGTTT M . paratuberc . 

2 73 7 y GTcdGACGCCAGTTCGGGJIjGGAGTCGTTGTTGAAATACC M . phi e i 

2 66 8 ACgTCGACGGJ:AGTTGGGGTGGAGTCGTTGTTGAAATACC M. leprae 

1910 *" M.gastri 

23 72 ACCTCAlACGCCAGTTGGGGTjGGAGTCGTTGTTGAAATACC M. kansasii 

4 822 BqrCACjACGCCAGTjG'IiGGGiljGGAGTCGTTGTTGAAATACC M * smegma t i s 



Figure 11 
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2450 2460 2470 24 8 0 

3385 Ll CT--Ti~r"T T - r ^^. ^ ^"nrit,AAiL M . tuberculosis 

2685 £CTCTGATCGTATTGGACAC^CTAACpTCGAACCCTj-TATC M . avium 
2 S S5 ACTCTG ATCGTATTGGa'cAC CTAAQGjTCGAAC C CtJ-TATC M . paratuberc . 
2777 ACTCTGATCGTATTGGH-CGTCTAACCTCC^ACCGTGGATC M.phlei 
2708 ^CTCTGATOGTATTG^CATCTA^-CCTCGAACCgTATATC M. leprae 
LJ — M.gastri 

2412 ACTCTGATCGTATTGCSCAgCTAACpTCGAACCCTGAATC M.kansasii 

4862 ACTCTGATCGTATTGGGCgTCTA^CCTCC^ACCGTAtATC M. smegmatis 

, 1 1 r 



24S0 2500 2510 2520 

3 a 2 5 C-C-Gl'l^GpC - a CAG ru-CC rGC-CC-C-GTAGTTTAACTGC-C-C-C M . tuberculosis 
272* GGGTTCACGGACAC-TGCCTGGCC-GGTAGTTTAACTGGGGC M. avium 

272* GGGTTC^OGGACAGTGCCTGGCGGGTAGTTTAACTGGGGC M . paratucerc . 
2817 CGGTTCAGGGACAGTC-CCTGGTC-GGTAGTTTAACTGGGGC M.phlei 
2 7A8 teGTTT^GGGACAGTGCCTGGCGGGTAGTTTAACTGGGGC M . leprae 
<-j M . gastn 

2452 GGGTTRAgGGACAGTGCCTGGCGGGTAGTTTAACTGGGGC M.kansasii 

4 9 02 'CGGTTSAGGGACAGTGCCTGGJGGGTAGTTTAACTGGGGC M. smegmatis 
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2930 2940 2950 2960 

i i i i_ 

3 864 AGTACGAGAGGACCGGGACGGACGAACCTCTES^^CCA M. tuberculosis 

3163 AGTACGAGAGGACCGGGACGGACGAACCTCTGGTATACCA M. avium 
3163 AGTACGAGAGGACCGGGACGGACGAACCTCTGGTATACCA M . paratube r c . 
3 256 AGTACGAGAGGACCGGGACGGACGAACCTCTGGTATjACCA M.phlei 
3187 AGTACGAGAGGACCGGGACGGACGAACCTCTGGTATjACCA M . leprae 
1910 M.gastri 
2 891 AGTACGAGAGGACCGGGACGGACGAACCTCTg.GTGCACCA M . kansas i i 

5342 AGTACGAGAGGACCGGGA.CGGACGAACCTCTGGTjkTACCA M . smegmatis 
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2970 29S0 2SS0 3000 

290 a gttgtE^C^ C-GC-GCACCC-CTC-C-ATAC-CCACGTTCGC^I: M ■ tuberculosis 
3 2 03 GTTGTCCCACCAGGGGCACC-GCTGGATAGCCACGTTCGGA M . avium 
3203 GTTGTCCCa'cCAGGGGCACGGCTGGATAC-CCACGTTCGG^ M . paratuoerc . 
3 296 GTTGTCCCXCCAC-GGGCACCGCTGGATAGCCACGTTCGGA M . phlei 
3 227 GTTGTCTCACCAGGGGCACCGCTGGATAC-CCACGTTCGG^ M . leprae 
1910 1-1 1-1 M.gastri 

2 931 GTTGTCCCACCAGGGC-CACCGCTGGATAGCTACGTTCGG^ M . kansasii 

53 82 GTTGTCCCACCAC-C-GGCACGGCTC-GATAGCCACGTTCGGAI M. smegmat is 



t r 



3010 3020 3030 3040 

3 94 a T^GATAJICC GC-GAAAGCATCTAAGCGGGAAACCTTCTC M. tuberculosis 
32*3 CAGGATA ACCGC~GAAA.GCATCTAAGCGGGA-AA.CCTTCi'C M . avium 
3 243 CAGGATAACCGCTGAAAGCATCTAAGCGGGAAACCTTCTC M . paratuberc . 
3336 CAGGATAJICCGCTGAAAGCATCTAAGCGGGAAACCTC^TC M . phlei 
3 267 CASGATAACCGCTGAAAGCATCTAAGCGGGAAACCTTCTC M . leprae 
1910 ~" * M.gastri _ 

2 971 CAGGATAACCGCTG£AAGOATCTAAGCGGCA?^CCTTCTC M . Jcansa s 1 1 

5422 CAGGATAACCGCTGAAAGCATCTAAGCGGGAAACCTjC^TC M . smegmat is 



1 

3090 
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3100 3110 3120 

4023 CCCGC-AGAACA CGGC- ESgiM.&^AGA CCl'GGAAGCT M . tuberculosis 
3322 CCCGC-AGACCACGGGSttGATAGGCCAGACCTGGAAGCT M. avium 
3322 CCCGC-AGATCACGGGA ! T|rGATAGGdcAGACCTGGAAGCT M . paratuberc . 
3415 CCCGC-AGAUcACGGGATCC,ATAGAdcAGACCTGGA|GC§ M. phlei 
33 09 " - M. leprae 

1910 _ M.gastri^ 

3 050 CCCGC-AGAACACGGGTTQC-ATAGG.GCAGACCTGGAAGCT M . Kansasil 



5501 CCCGC-AGAeCACGGGATTdATAGAgCAGACCTGGAAGCg M. smegmat is 
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50 60 70 80 

GCGGCGTGCTTAACACATGCAAGTCGAACGGAAAG£SXC2)C M. tuberculosis 
GCGGCGTGCTTAJICACATGC JLAGTCGAACGGAAAGGTCTC M . bovis 
GCGGCGlSCTTAACACATGCAAGTCGAACGGAAAG^CTC M . avium 

| ' ^TTAACACATGCAJlGTgGAACGGAAAG^qcgC M. intracellulars 

GCGGCGTGCTTAJ^CACATGCAJlGTCGAACGGAAAG^CjCTC M . paratuberc . 
GCGGCGTGCTTAJ^AJjVTO^ M . scrof ulaceum 

GCGGCGTGCTTAA.CA.CATGCAA.GTCGAACGGAAAGGTCTC M . leprae 
pCGGCGTGCTTAACACATGCA-AGTCGA-ACGGAA-AGGTCTC M . kansasii 
GCGGCGTGCTT:AACACATGCAAGTCGAACGGAAAGGTCTC M . gastri 



1 1 r- 

90 100 110 120 

T tcC- GEGATh CTCGAGTGGCGAACGGGT M.tuoerculosis 

T TCGGAG A7ACTCG AGTGGCG AACGGGT M . bov i s 

T TCGGAQSTACTCGAGTGGCGAACGGGT M. avium 

T tcGGGg'-TACTCGAGTGGCGAACGGGT M. intracellulare 

T TCGGAQG7ACTCGAGTGGCGAA.CGGGT M . paratuberc . 

T tCGGQg!g7A.C7CGAGTGGCG AACGGGT M. scrof ulaceum 

T AAAAAATCTTTT TIA.GAGATACTCGAGTGGCGAACGGGT M. leprae 

T TCGGAGACACTCGAGTGGCGAACGGGT M . kansasii 

T TCGGAGACACTCGAGTGGCGAACGGGT M . gastri 

, 1 " i r 

130 140 150 160 



GAGTAACACGTGGGE^TCTGCCCTGCACTTC-GGGATAA M. tuberculosis 
GAGTAACACGTGGGTGATCTGCCCTGCACTTC-GGGATAA M . bovis 
GAGTAACACGTGGGCAATCTGCCCTGCACTTC - GGGATAA M. avium 
GAGTAACACGTGGGCaIatCTGCCCTGCACTTC-GGGATAA M. intracellulare 
GAGTAACACGTGGGCaIaTCT^CCCTGCACTTC - GGGATAA M . paratuberc . 
GAGTAACACGTGGGjCAATCTGCCCTGCACTTC - GGGATAA M . scrof ulaceum 
GAGTAACACGTGGGTAATCTGCCCTGCACTTCSgGGATAA M. leprae 
GAGTAACACGTGGGjC^TCTGCCCTGCACAQC - GGGATAA M . kansasii 
GAGTAACACGTGGGgAATCTGCCCTGCAGAqC - GGGATAA M . gastri 



Figure 2A 
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170 180 ISO 200 

109 GCCTGGC-AAAC TC-C-GTCTAATACCQC-ATAC-C-ACCACGG^ M ■ tuberculosis 

24 8 GCCTGGC- AAACTC-GGTCTAATACCGGATAGGACCACGGGA M . bovi S 

lis GCCTGGGAAACTGGGTCTAATACCGGATAGGACC^qAAGA M . avium 

9 8 GCCTGGGAAACTC-GGTCTAATACCGGATAGGACCTTTAGg M . intrace 1 lulare 
1A 6 GCCTGGGAAJICTGGGTCTAATACCGGATAGGACCgGAAlGA M . paratuberc . 

1 0 9 GCCTGGGAAACTC-GGTCTAATACCQGATAGGACCAOTTG^ M . scrof ul aceum 
16 0 GC^TGGGAAACTGGGTCTAATACCC^-ATAGGA(^CAAGg M . leprae 

108 GCCTGGGAAACTGGGTCTAATACCGGATAGGACCAqrTGd M . kansasil 

109 GCCTGGGaAACTC-GGTCTAATACCGGATAGGACCACJ^GG M . gastn 



210 220 230 240 

149 ; ^ c »_ T ^TTr.ra^ M ■ tuberculosis 

2 8 8 TGCATGTCTTGTGGTGGAAAGCGCTTTAGCGGTGTGGGAT M . bovi s 

18 6 CGCATGTCT^C-GGTGGAAAGC- TTTT- ACGGTGTGGGAT M . av lum 

138 IdGCATGTCTI^GGTC-GAAAGT'-iCTT^CGGTGTGGGAT M. intracellulai 

1 8 6 CGCATGTCTTCTGGTGGAJLAGC-^TTT|-:GCGGTGTAGAAT M . paratuberc . 

1 a 9 9gCATG5ctTGTC-GTGGAJ^G^-^TTTOGCGGTGTC-GGAT M . scrof ulaceum 

2 00 @GCATGTCTTGTGGTGGAAAGQ-TC|TT , rrGCGGTG | C^GGAT M . leprae 
14 8 UgCATGSCTTGTGGTGGAAAG.- -CTTTjljGCGGTGTGGGAT M.kansasil 
14 9 'dGCATGldcTTGTGGTGGAAAG!- - CTTTT3GCGGTGTGGGAT M . gastr i 
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250 260 270 280 

189 (^cCCGCGGCCTATCAGCTT GTTGGTGGGGTGACGGCCT M ■ tuberculosis 



328 GAGCCCGCGGCCTATCAGCTTGTTGGTGGGGTGACGGCCT M . bovi S 

2 24 GSGCCCGCGGCCTATCAGCTTGTTGGTGGGGTGACGGCCT M. avium 

176 GyGCCCGCX3GCCTATCAGCTTGTTGGTGGGGTGA|GGCCT M . intracellulare 

2 24 GGGCCCGCGGCCTATCAGCTTGTTGGTGGGGTGACGGCCT M . paratuberc . 

187 3dGCCCGCGGCCTATCAGCTSGTTGGTGGGGTGA|GGCCT M . scrof ulaceum 

239 ddGCCCX3CGGCCTATCAGCT^TTSGTGGGGT§ACGGCCT M. leprae . 

186 GGGCCCGCGGCCTATCAGCTTGTTGGTGGGGTGACGGCCT M . kansas 1 1 
GGGCCCGCGGCCTATCAGCTTGTTGGTGGGGTGACGGCCT M . gastr i 
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389 A^^CCTCTTkA CCATCGACC-.^.GGTCCGGGTulCjrCTCG(i M. tuberculosis 
528 a A* CCTCTTTCACCATCGACGAAGGTCCGGGTTCTCTCGG M . bov i s 
a 2 a ajLA.CCTCTTT'CACCATCGACGAAGGTCCGGGTT|I]TCTCGG. M . avium 

AAACCTCTTTC-CCa.~CGACGAAGGTCCGGGTTpiTCTCGG M . intracellulare 



a 2 a AAACCTCTTTCACCATCGACGAAGGTCCGGG^TCTAGG M . paratuoerc . 

3 87 AAACCTCTTTCACCATCGACGAa.GGgCA^-CTT^GG M . scrof ulaceum 

4 3 9 aaACCTCTTTCACCATCGACGA.aGGTC^GGG^ATTCTCGG M . leprae 

3 8 6 AA» CCTCTTTCACCATCGACGAAGGTCCGGGTTCTCTCGG M . kansas i i 

3 87 AAACCTCTTTCACCATCGACGAAGGTCCGGGTTCTCTCGG M . gastri 
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1130 1140 1150 1160 

10 69 T'^'^aTnTTfirrag cfc (^GTAATG GTE GGGAC'i.'CG'J. bA^la M . tuberculosis 
12 08 TCTCATGTTGCCAGCACGTAATGGTGGGGACTCGTGAGAG M . bovis 
1104 TCTCATGTTC-CCAGCGC-GTAATGCCGGGGACTCGTGAGAG M . avium 
1056 TCT C A V GTTG CCAG CGGGTAATGCCGGGGACTCGTGAG AG M . intracellulare 
109 8 TCTCATGTTC-CCAGOGGGTAATGCAGGGGACTCGTGAGAG M . paratuberc . 
1064 TCTCATGTTGCCAGbGdGTAATGCGGGGGACTCGTGAGAG M . scrof ulaceum 
1119 TCTCATGTTGCCAGCACGTAATGGTGGGGACTCGTGAGAG M. leprae 
106 6 TCTCATGTTGCCAGOGGGTAATGCCGGGGACTCGTGAGAG M . kansas 1 1 
1067 TCTCATGTTGCCAGCGGGTAATGtdGGGGACTCGTGAGAG M . gastri 



, 1 1 r 

1250 1260 1270 128 0 

1189 CAATGGCCGGTAC^AGGGCTC-CGATGCCG^GGTTAAG M . tuberculosis 

1328 CAATGGCCGGTACAAAGGGCTGCGATGCCGCGAGGTTAAG M. bovis 

1224 CAATGGCCGGTACAAAGGGCTGCGATGCCGTAAGGTTAAG M. avium 

1176 CAATGGCCGGTACA?AGGGCTGCGATGCCGCAGGTTAAG M. intracellulars 

1218 CAATGGCCGGTACAAAGGGCTGCGATGCCGTA^GGTTAAG M . paratuberc . 

1184 CAATGGCCGGTACAAAGGGCTGCGATGCCGGa!aGGTTAA.G M.scrotulaccum 

123 9 CAATGGCCGGTACA^A.GGGCTGCGATGCCGCaIaGGTTAAG M. leprae _ _ 

1186 CAATGGCCGGTACAAAGGGCTGCGATGCCGCGAGGTTAAG M.kansasn 

1187 CAATGGCCX3GTACAAAGGGCTGCGATGCCGCGAGGTTAAG M . gastri 



Figure 2C 
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1290 1300 1310 1320 
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122 9 CGAATC CTTS^ AGCCGGTCTCAGTTCGGA'lEtSGGGTCT M . tuberculosis 

13 6 8 CGAATCCTTA- AAAGCCGGTCTCAGTTCGGATCGGGGTCT M . bovi s 
12 64 CGAATCCTTTT^J^ M . avium 

1216 CGAATCCTTTTAAAGCCGGTCTCAGTTCGGATjTGGGGTCT M. intracellulars 
12 5 8 CGAATCCTTTTAA^CCGGAXTCAGTTCGGAljgGGGGTCT M . paratuberc . 
12 24 CGAATCCTTTTAAAGCCGGTCTCA.GTTCGGATCGGGGTCT M. scrof ulaceum 
12 7 9 CGAATCCTTTTAAAGCCGGTCTCAGTTCGGATCGGGGTCT M. leprae 
1226 CGAATCCTTTTAAAGCCGGTCTCAGTTCGGATCGGGGTCT M.kansasii 

12 2 7 CGAATCCTTT-TAAAGCCGGTCTCAGTTCGGATCGGGGTCT M.gastri 

, 1 i r 

1330 1340 1350 1360 

! ! L L 

1268 GCAACTCGACCCCE3TGAAGTCGGAGTCGCTAGTAATCGCA M . tuberculosis 

14 07 GCAACTCGACCCCGTGAAGTCGGAGTCGCTAGTAATCGCA M.bovis 

13 04 GCAACTCGACCCCATGAAGTCGGAGTCGCTAGTAATCGCA M . avium 

1256 GCAACTCGACCCCATGAAGTCGGAGTCGCTAGTAATCGCA M . intracellulars 
12 98 GCAACTgGACCCA^TGAAGTCGGAGTCGCTAGTAATCGCA M . paratuberc . 
1264 GCAACTCGACCCCGTGAAGTCGGAGTCGCTAGTAATCGCA M . scrof ulaceum 
1319 GCAACTCGACCCCGTGAAGTCGGAGTCGCTAGTAATCGCA M . leprae 

1266 GCAACTCGACCCCGTGAAGTCGGAGTCGCTAGTAATCGCA M.kansasii 

1267 GCAACTCGACCCCGTGAAGTCGGAGTCGCTAGTAATCGCA M.gastri 



Figure 2D 
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168 TGCCC KTCCd GG VG GAAAAGTAGGACACCGCCGAAC M . tuberculosis 

7 9 TGCCCCTCCGGGG - - - TGGAAAAGTAGGgCACCGCCGAAC M . bov i s 
8 1 TGCCGfcSQqGGd - - - TGGAAAAGTAGGACACCGCCGAAC M . phlei 
3 5 9 9 TGCCCAT:TCGC-G - - - - TGGAAAAGTAGGACACCGCCGAAC M . 1 epr ae 
5782 tSCCcSt'-'QCGGG TGGAAAAGTAGGACACCGCCGAAC M . smegmat is 



Figure 3 
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382 GGC-AC-CTGTC.^CCC-AC-CSg-ATCCGA C-C-ATTTCCG.^ M ■ 

3 82 GGGAGCTGTCA-.CCGAGCATTGATCCGAGGATTTCCGAAT M . parauuberc . 

1053 GGG>GCTGTCAACCGAGC^^^ M • tuberculous 

*67 GGGAGCTGTCA?.CCGAG6dTC-GATCCGAGGATTTCCGAAT M.phlex 

3 92 GGGAGCTGTCAACCGAGCGTGGATCCGAGGATTTCCGAAT M. leprae 

i 6 7 GGGAGCTGTCAACCGAGCGTGGATCCGAGGATTTCCGAAT M . gast r ! _ _ 

IlO GGGAGCTGTCAA2CGAC-CC-TGGATCCGAGGATTTCCGAAT M - «nsasii 

2548 5GGAGCTGTCA?XCGA^-CGTTGA^CGAGGAT§TCCGAAT M.smegmatis 
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170 180 190 200 

A62 GAATATATAGG GTGCG - GGAGG[I^ACGCGGGGAAG'1'GAAA M . avium 

1 62 GAATATATAGGGTGCG - GGA.GGTAACGCGGGGAAGTGAAA M . paratuberc . 

1133 GAATATATAGGGTGCG - GGAGGGAACGCGGGGAAGTGAAA M. tuberculosis 

547 GAATATATAGGCGTTG - GGGGGG AACGCGGGGAAGTGAAA M.phlei 

A 7 2 GAATATATAGGGTTCG - GGAGGGAACGCGGGGAAGTGAAA M . leprae 

24 7 GAATATATAGGGTGCG -GGAGGGAACGCGGGGAAGTGAAA M . gas t r i _ 

190 GAATATATAGGGTGCG - GGAGGGAACGCGGGGAAGTGAAA M . kansasil 

2628 GAATATATAGGC^CT-GC^GC^AACGCGGCGAAGTGAAA M.smegmatis 



Figure 4A 
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541 

541 

1212 
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250 260 ; 

G TC»GT*OTC^C(^GCX5ftJ ^ M . avium 

"ci^i^^-^C-CGAA^^E^GGCTAAACCG M . paratuberc 
I^^^^CG^CGG^CA-GGCT^CG M . tuberculosis 

-G^'"^GTAGTGGCGAGCGAAOG7'GGAATA7iGGCTAAACCG H . leprae 
^G^GTGC-CGAGCGAACGCC^^GGCTAAACCG M.gastr^ _ 
-GTA^GTA^TGGCGAGCGAJ^.OGCGGAACATiGGCTAAACCG M . kansasix 



2706 ^AGTAGTK-C(»GCGAACACC^^GGCrAAACSG M.smegtnatis 

— r- , " 1 r 

310 320 



290 300 



578 
578 
1250 
664 



|CATG>C^vc4ik^CCGGG'k^^xTGTGTGTGCGGGGV M . aviun 
^-CATG^ciACCGGGTAGGGGTTGTGTGTGCGGGGT M . paratuoerc 
C A5G - C ATGGG^ - ACCGGGTAGGGGTTGTGTGTGCGGGGT M . tuberculoses 
StG-CATC^C-^ACCCGGT^GGGGTTGTGTGTGCGG^GT M .phlei 



5 9 0 SS- CATG^ cV1aCTa]gGTAGGGGTTGTGTGTGCGG 2 GT M . leprae 
3 65 CAC^S - cATGGGTGACCGGGTAGGGGTTGTGTGTGCGGGGT M . gas tri _ _ 
III CA^G-CATGGGTAACCGGGTAGGGGTTGTGTGTGCGGGGT M.kansasn 

2745 |lATG§CATG*rGATACCGGGTAGGGGTTGTGTGTGCGGGGT M.smegmatis 



Figure 4B 
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330 



340 



350 



360 



617 TQTGG GSTTGAT ATGT CTCAC- OTCT? r CTGGCTGgGGl- C-G M. avium 
617 TGTGGGATTGATATGTCTCAGCTCTACCTCGCTGAGG- GG M . paratuberc . 
12 89 TGTGGGA^GATATGTCTCAGC qCTAC CffiGGCTGAGg-GG M . tuberculos i s 
7 03 TGTGGG5cC^~'jTGTC;-;CAT CGTCCG COGGOGA^GGCAG M . phlei 

629 ' ' ' '~' ~~ 

404 



TGTGGGATTGjGTATGTCTCAjACTCTACCTGGJjTGAGG - GG M . leprae 
TGTGGGATCGATAGGTCTCAGCTCTACCCGGCTGAGG - GG M . ga s t r i 



347 TGTGGC-ATidGATAdG-CTCAGCTCTACqGGGCTGAGG - C-G M . k 



ansasn 



2 7 85 TGTGGGA ; CCTAT:TT7C-CC-'cCTCTACCTGGCTG-:GAC-GG M. smegmatis 



370 
i 



380 



390 
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400 



6 5 6 TAGTCAGAAAG 

6 56 TAGTCAGAAAGT 

13 2 7 gAGTCAGAAAGT 

74 2 TAGTgATAAAGC 

6 6 8 TAGTCAGAAAGT 

44 3 ICAGTCAGAAAGTG 

3 8 6 jqAGTCAGAAAGl 



7 3TC3TGGTTAGCGGAAGTGGCCTGGGAEr M . avium 

3tcgtggttagcggaagtggcctgggac 
.tgtcgtggttagcggaagtggcctgggafi 
agtgtggttaggjgaagtggcctgggaji 
^dcgtggttagcggaaatggcctgggap 
•tcgtggttaacggaagtggcctgggat 
tcgtggttaaIcggaagtggcctgggat 



M . paratuberc . 
M . tuberculosis 
M. phlei 
M . leprae 
M . gastri 
M . kansasii 



2 823 ScAGTGAGA^ATGTTGTGGTTAGCGGAAA M. smegmatis 



410 



420 



43 0 



440 



6 96 ggccq sccgtag; 

6 96 GGCCCGCCGTAGA 
13 67 GG^C^GCCGTAC-.- 
782 



in i 
sue 



.CGGTGAGAGCCCGGTACGCGAAA-ACC M . avium 
.CGGTGAGAGCCCGGTACGCGAAA-ACC M . paratuberc . 
.CGGTGAGAGCCCGGTACGCGAAA-ACC M. tuberculosis 
^GGTGAGAGCCCG^ACQCGAAA-ACg M. phlei 
CGGTGAGAGCCC§GTACGCGAAA-@CC M . leprae 
.CC-C-TGAGAGCCCGGTACG3gAAA-ACC M. gastri 



GGgCpjGCCGTAGT 
7 08 GGCCppCCGTAGA 

4 83 GCTCIjGCCGTAGA m 

4 2 6 GG^cHGCCGTAC-ACC-GTGAGAGCCCGGTACG^GAAA-ACC M . kansasii 



2 863 GG^CTGCCGTAGACGGTGAGAGCCCGGTACGgGAAA- ACC M . smegmat is 



Figure 4C 



20/32 



■ . r 



450 
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735 ^-n^rTP.rr ^ATATC.^.CAC C CC-AC-TAGCAC-CGGUCU M . avium 
7 3 5 CGGCACCTGCCTTATATCAACACCCGAGTAGCAGCGGGCC M . par atuberc . 
1*06 CGGCACCTGCCTAGTATCAATTCCCGAGTAGCAGCGGGCC M . tuberculosis 



820 
747 



TGC^CCpGC^cFCAGG^-TCCCGAGTAGCAGCGGGCC M . phlei 
'iTGG^CCTGCCITGTATCAATTCCCGAGTAGCAGCGGGCC M . leprae 



522 CGGCACCTGCCTTGTA.TCAATTCCCGAGTAGCAGCGGGCC M.gastri _ 
465 CGGCACCTGCCT-GTATCAATTCCCGAGTAGCAGCGGGCC M.kansasn 

2 902 CGA i CGTCTG"TC-- GATGC-TGTT CCCGAGTAGCAGCGGGCC M . smegma t i s 
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570 530 



8 55 g^GGGAATGGTGAAA^GTACCCCGGG^GGG-AGTUAAA-^ M. avium 

855 G AGGGAATGGTGAAA*GTACCCCGGGAGGG-AGTGAAATA M . paratuberc . 

1526 GAGGGAATGGTGAAAAGTACCCCGGGAGGGgAGTGAAAGA M . tuberculosis 

937 GAGGGAATSgTGAAAAGTACCCCGGGAGGG - AGTGAAAgA M.phlei 

8 67 GAGGGAATGGTGAAAAGTACCCCGGGAGGG^AGTGAAATA M . leprae 

6^2 GaGGGAATGGTGAAAAGTACCCCGGGAGGGGAGTGAAASA M.gastri _ 

585 GAGGGAATGGTGAAAAGTACCCCGGGAGGGGAGTGAAAGA M.kansasn 

3 022 GAGGGAATGGTGAA-AAGTACCCCGGGAGGqGAGTGAAAgA M.smegmatis 



Figure 4D 
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8 s a QTACCTC-^AACCC-TGTGCCTACAATCCGTCAGAC-Cqrcc£ M . avium 

8 o I GTACCTGAAACCGTC-TGCCTACAATCCGTCAGAGCCTCCT M . paratuberc . 
1566 GT ACCTC- AAACCGTGTGCCTACAATCCGTCAC-AGCCTCCT M . tuberculosis 
976 GTACCTGAAACCGTGTGCCTACAATCCXBTCAgAGCqC^CT M.phlei 

9 07 GTACCTGAAACCGTGTC-CCTACAATCCGTCAGAGCCTCi 
682 GTACCTGAAACCGTGTGCCTACAATCCGTCAGAGCC^gi 
62 5 GTACCTGAAACCGTGTGCCTACAATCCGTCAGAGCqCTT 

3 062 GTACCTGAAACCC-TGqGC^TACAATCCGTCAGAGCCC^Cg M . stnegmatis 



T M . leprae 
T M. gastri 
T M.kansasii 
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650 660 670 680 

JL_ 1- 



934 



(S 1 G TGT5 GGTGATGGCGTGCCTTTTGA M . avium 

o 34 c _ GTGGGGTGATGGCGTGCCTTTTGA M . paratuberc . 

^606 TTTCCTCTCCGGAGGA G§G?GGTGATGGCGTGCCTTTTGA M. tuberculosis 

1016 CTT GTAGTGGGGTGATGGCGTGCCTTTTGA M.phlei 

o 47 r£rt "GTGGGGTGATGGCGTGCCTTTTGA M . leprae 

722 9 GTGGGGTGATGGCGTGCCTTTTGA M . gastri 

GTGGGGTGATGGCGTGCCTTTTGA M.kansasii 



665 C- 



3102 ACGTGT; 



— i — 

690 



■GTGGGGTGATGGCGTGCCTTTTGA M . stnegmatis 
~i i r 



700 710 720 
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959 »™» T» * ^-^Tr-r^OTf AGQG AfcACGTCGCG> .GGTTAAC M . avium 
9 59 AGAATGAGCCTGCGAGTCAGGGACACGTCGCGAGGTTAAC M . paratuberc . 
1646 AGAATGAGCCTGCGAGTCAGGGACATGTCGCAjAGGTTAAC M . tuberculosis 
104 6 AGAATGAGCCTC-CGAGTCAGGGACMGTCGCGAGGTTAAC M.phlei 
972 AGAATGAGCCTGCGAGTCAGGGACATGTCGCGAGGTTAAC M. leprae 
7A 7 AGAATGAGCCTGCGAGTCAGGGACATGTCGCGAGGTTAAC M . gastri 
6 90 AGAATGAGCCTGCGAGTCAGGGACATjGTCGCGAGGTTAAC M . kansas i 1 

3132 AGAATGAGCCTC-CC-AGTCAGGGACAgGTCGCGAGGTTAAC M.smegmatis 



Figure 4E 
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1033 CG bATCCCCTTTGGG GTGt TAGTGGCGTGT M. avium 

1039 CGCATCCC^TTTGGG GTGTAGTGGCGTGT M.paratuberc. 

1726 CGA05cgA^GC AfACG CGCGf^ M. tuberculosis 

112 6 CGTATCCAACCTGTT gGGGTTd GTGTAGTGGgGTGT M.phlei 

1052 CgMaTCScGTGTGAGCCT GTGTAGTGGCGTGT M. leprae 

82 7 CGTATCycteCGTAAC-CGT GTGTAGTGGCGTGT M . gas t ri 

770 COTATcHcbcGCG^GCGlj GTGTAGTGGCGTGT M.kansasii 

3 212 CGTATCCACACA^GAGTGTGTG GTGTAGTGGCGTGT M.smegmatis 
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1050 1060 1070 1080 
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13 07 ruarr £ A af?TCCGAATGCC b - TGGTG - TAAAAGCJ 3TGGCA M. avium 
1307 CAGCCAAACTCCGAATGCCG - TGGTG - TAAAAGCGTGGCA M.paratuberc. 
2005 CAGCCAAACTCCGAATGCCG - T GGTG - TAAAAGCGTGGCA M. tuberculosis 
1401 C AG C CAAACT C CG AATGC CGATAAG -j - TgAAAGTjGTGGCA M.phlei 
1323 CAGCCAAACTCCGAATGCCG -TGGTlj- TAAAAGCGTGGCA M. leprae 
1038 CAGCCAAACTCCGAATGCCG - TGGTG - TATjApjGCGTGGCA M.gastri 
1041 CAGCCAAACTCCGAATGCCG-TGGTG-TATaUgCGTGGCA M.kansasii 

34 8 6 CAGCCAAACTCCGAATGCCGGTj^GGCCAAGAG^GCGGAA M . smegma t i s 
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1170 1180 1190 1200 

i i i u 

14 2 5 AGTGGAAAAGGATGTG ITAGTCGCK^ -GACAACCAGGAGG M . avium 

1425 AGTGG AAAAGG ATGTGTAGT CG C AG A - G ACA-AC C AGG AGG M.paratuberc. 
2122 AGTGGGAAAGGATGTGCAGTCGCAAA-GACAACCAGGAGG M . tuberculosis 
1519 AGTGGAAAAGGATGTQCAGTCGCqGaSgACAACCAGGAGG M . phle i 
1441 AGTGGAAAAGGATGTGCAGT CG CAAA - GACAACCAGGAGG M . leprae 
1215 AGTGG^AAAGG ATGTGCAGTCGCAGA - GACAACCAGGAGG M.gastri 
1158 AGTGC^AAJIGGATGTGCAGTCGCAGA- GACAACCAGGAGG M.kansasii 

3 60 6 AGTGGAAAAGGATGTGAkGTCGCAGA^GASAACCAGGAGG M . smegma t i s 



Figure 4F 
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1250 1260 1270 1280 

15 04 CTCACTGGTCAAGTGATTIATGCGCOGATAATGTAGCGGGG M. avium 

15 04 CTCACTGGTCAAGTGATTATGCGCCGATAATGTAGCGGGG M.paratuberc . 

2201 CTCACTGGTCAAGTGATTGTGCGCCGATA-ATGTAGCGGGG M . tuberculosis 

15 9 8 CTCACTGGTCAAGTGATt|gTGCGCJ;GATAATGTAGCGGGG M . phle i 

152 0 CTCACTGGTCAAGTGATTGTGCGCCGATAATGTAGCGGGG M. leprae 

12 94 CTCACTGGTCAAGTGATTGTGCGCCGATAATGTAGCGGGG M . gastri 

1237 CTCA.CTGGTCAAGTGATTGTGCGCCGATAATGTAGCGGGG M.kansasii 

3 6 86 iTTCACTGGTCAAGTGATTGTGCGCCGATATiTGTGGCGGGG M.smegmatis 



i i i r 

12S0 1300 1310 1320 

! • ' _ 1 

1544 PTCAAGCACACCGCCGAAGCCGCGGCACAT lTCATCTT-TA M. avium 

1544 CTCAAGCACACCGCCGAA-GCCGCGGCACATTCATCTT - TA M . paratuberc . 
2241 CTCA-AG C AC AC C 3 C C G AAG C C G CGG C A CATg C AC CTTGTpj M . tuberculosis 

16 3 8 CTCAJ\GCACACCGCCGAAGCCGCGGCAi53 T ^jR^^ , ^ M . phi e i 

156 0 CTCAAGCACACCGCCGAAGCCGCGGC ACATTC ACCTTjGTA M. leprae 

1334 CTCAAG CA CAC CG CCGAAGC CGCGACA ^jCCpq-PjA M. gastri 

1277 CTCAAGCACACCGCCGAAGCCGCGACa ! ; A;qCffg-j-;A M.kansasii 



3 72 6 [PTC AAG CAXTAC CG CCG AA.GC CG CGGA A'- - G C C AA OQ Tgj Tg M . smegma t l s 

j 1 1 r 

1330 1340 1350 1360 

' 1 _ I L. 

1583 CGGTGGAl TGTGGGTAGGGGA-GCGTlCCCgCATTCAGCGAAG M . avium 

15 8 3 CGGTGGATGTGGGTAGGGGAGCGTCCCCCATTCAGCGAAG M . paratuberc . 

22 8 0 ^GGTGGGTGTGGGTAGGGGAGCGTCCCTCATTCAGCGAAG M . tuberculos i s 

1676 TGG^GyTGTGGGTAGGGGAGCGTCQTC^CATgqgGgGAAG H.phlei 

160 0 GGGTGGATGTGGGTAGGGGAGCGTjTCCTCATTCAGCGAAG M. leprae 



1367 



AGGTi iTGGGTAGGGGAGCGTCCOIjCATTCAGCGAAG M . gastri 



1310 jAGGTj |TGGGTAGGGGAGCGTCCqijCATTCAGCGAAG M . kansasi i 



3 764 p?T |TGGGTAGGGGAGCGTCCyG-ATfCQGGTGAAG M . smegma 1 1 s 



Figure 4G 
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1370 1380 1390 140 0 

-,623 i rTrf^GGGTGACCGGTGGTGGAGGG TGGGGGAGTGAGAAT M. avium 
llll CTCCGGGTGA'^'CGG r i"GGTGGAGGGTGGGGGAGTGAGAAT M . paratuberc 

2 19 cSc^TGA^^ M. tuberculosis 

^cSStg^cc^tc^tggagc^tc^gagtgagaat M.phlex 

Hit ctTCCGGGTA^A^CGGTGGTGGA.GGGTGGGGSAGTGAGAAT M leprae 
11 02 cicGCCGGGTGACCGGTGGTGGA.GGATGGGGGAGTGA.GAAT M.gastn 
1345 CT^CCGGGTC-ACCGGTGGTGGAGGATGGGGGAGTGAGAAT M . kansasu 

3 7 96 CCGCCGSGTA_T03AGiGGTGGAGGG l GJGGC-A^ . i~ -v.«. a 



1530 



1 " 1 ' 

154 0 1550 1560 



1781 CGATGGACAACGC-GTTC-ATATTCCC GTACCCGTG'ilATm M ■ avium 

llll CGATGGACAACGGGTTGATATTCCCGTA.CCCGTGTATGGG M . paratuberc . 

?R75 cGATGGACAACGGGTTGATATTCCCGTACCCGTGTATGAG M.phlex 
180 0 ^JS^^TTGATATTCCOGTACCCG^TG^ M . leprae 
llll S^C^C^TT<aTATTCCCGTACCC»TGT|G^ M.gastn 
1505 CGATGGACaI^ 

3 956 CGATGGACA^CGGGTTGATATTCCCGTACCCGTGTATGT^ M. smegmatis 

, I > ■ r n 

1570 1580 1590 160 ° 

, sn ■cgTCCCTGaI^^ATCA-GCC-GTA CTAACcAcCCAAAACCG M. avium 
llll StStS-^TCA-GCGGTACTAACCACCCA^A-CCG M. paratuberc 
2519 35K^ M . tuberculoses 

llll CGTCCCTGATGAATCT^A^rr^TgCTAACCACCCAJ^J^CCQ M.phlei 
llll ^cS^TGA^TC^^TA^CCACCCAAAACCXS M . leprae 
^602 CGGCCGTGATGAATCA-GCGGTACTAACCA.CCCAAAACCG M.gastrx 
1^45 C^CC^TGATGAATCA-GCGGTACTAACCACCCAAAACCG M.kansasix 

3996 GGTCCaTGATGAATCA-GCGGTACTAACCAgcCAAAACCa M . smegmatis 

Figure 4H 
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1610 



162 0 



13 6 0 GAT - CG A C CAT - 
18 6 0 GAT 
2558 GAT 
1355 GgC 
18 7 9 GAT 
1641 GAT 
1584 GAT 



1630 



1640 
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CGACCAT-T 
CGATCACj-I 
CGATq r -AI 
CGACCATA7 
CGATCAC 



pCCCTTCGGGGC-EEGTGGCGATTZQGG M . avium 
tcCCTTCGGGGGC-GTGGCGATT-CGG M . paratuberc . 
■CCCCTTCGGGGC^GTGGgGjljlg^GG M . tuberculosis 

— ~~ ' GTGACGqTTiGjGG M.phlei 

CC^TTCGGGG^^TGGAGjGTT-CGG M. leprae 
■CCCCTTCGGGGG^-GTGG^Gj^T]Cj-[IjGG M . gastri 



CGATCA;C- TCCCCTTCGGGGGC - GTGGA^G^Tjcj ~ ^IjGG M.kansasii 
4 03 5 jACCGjGACCfeCAC'CT - -;TTCGGGG^^GTGGCGjjljGGTGG M . smegma t i s 



1650 
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1896 f^PTr,rr,Tr-r j~-5r.CTTCGCi rC-GTAGTAGTCAAGC5S33GG M . avium 
1 8S6 GGCTGCGTGGG.-.CCTTCGCTGGTAGTAGTCAAGCAATGGG M . paratuberc . 
259A GGCTGCGTGGGAACTTCGCTGGTAGTAGTCAAGCpAgGGG M . tuberculosis 
1 S 8 6 GGCTGCGTGGGACCCG^GTqGGTAGTAGTCAAGqGATGC-G M . phi e i 
1917 GGCTGCGTGGGAACTTCGfETGGTAGTAGTCAAGqGATGGG M . leprae 
1677 GGCTGCGTGGAC-CCTTCGCTGGTAGTAGTCAAG^GATGGG M . gastri 
162 0 GGCTGCGTGGAGCCTTCGCTGGTAGTAGTCAAGCjGATGGG M . kansas 1 1 



4 071 GGCTGCATGGGACCTTCG^TGGTAGTAGTCAAGCGATGGG M . smegmat is 



• • • 



, , ' r 

1730 1740 1750 1760 

197 a - r Tfyyysr a k r -c rttaTAG - - AG AGCGATAGGC AAATCCGT M . avium 

1974 - CTGGGGCAAGCCCGTAG - - AGAGCGATAGGCAAATCCGT M . paratuberc . 

2 6 7 2 - CTGGGGCAAGCCGGTAGGGAGAGCGATAGGCAAATCCGT M - tuber culos i s 

2 063 -C^GGGGTAJ^.CCTGTAGGGGGAG^GATAGGCAAATCCGT M.phlei 

1 9 9 5 - CTGGSGCAAGCCCGTAGbGAGAGCGATAGGCAAATCCGT M . 1 eprae 

1755 -ctggggcaagccagtagggagagcgataggcaaatccgt M. gastri 

1698 -CTGGGGCAAGcdAGTAGbGAGAGCGATAGGCAAATCCGT M . kansasii 
4 14 9 - OgGdgGgAAGCCaC-TAGjGC^.GTC^GATAGGTAAATCCGT M . smegma t i s 



Figure 41 
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1850 I860 1370 1880 

2 089 GCGAGC ffiCTTSC^ -C-CCCGTACCCCAAACCAACACAGGT M . avium 
2 0 8 9 GCGAGCACATACACTGCCCGTACCCCAAACCAACACAGGT M . par atuberc . 
2 789 GCGAGCACAgACACGC-CCCGTACCCCAAACCgACACAGGT " M . tuberculos i s 
217 9 GCSAGCgCATACACGGCCCGTACCCCAAACCAACACAGGT M . phlei 
2112 GCGAGCATA^A^CGGCCCGTACCCCAAACqGACACAGGT M. leprae 
1872 GCGA.GCACA.jcACACGGCCCGTACCCCAAAXCjGACACAGG M. gastri 
1815 GCGAGCACA|qACACGGCCCGTACCCCAAACqGACACAGGT M . kansasii 



4 2 66 GCGAGGACATACACGC-CCCGTACCCCAAACCAACACAGGT M. smegmatis 



• • • 



, ! 1 r 

1970 1980 1990 2000 

■ i l 

2 208 AGGGGG t2CCGC-AATACC GTGAACACCCTTGCGGTGGGAGC M. avium 

22 08 AGGGGGCCCGGAATACCGTGAACACCCTTGCGGTGGGAGC M . par atuberc . 
2 90 8 AGGGGGSCCGGAATATCGTGAACACCCTTGCGGTGGGAGC M . tuberculos i s 
2 2 98 AGGGGGAC CgACG TACCGTGAGGGCTCTTG CGGgGGgAGC M . ph 1 e i 
2 231 AGGGGGG'cCGGAATATCGTGAACACCCTTGCGGTGGGAGC M. leprae 
1910 '- 1 M. gastri 

1934 AGGGGG^CCGGAATACCGTGAACACCCTTGCGGTGGGAGC M . kansasii 

4385 AGGGGG^CCjCACAI^CGTGTA^-C(^TTA^CGG£^^GC M. smegmatis 



_ , , 1 r 

2010 2020 2030 2040 

2248 GGGATE£GS3CGCAGAAACCAGTGE3AGCGACT-GTTTA M. avium 

2 24 8 GGGATTCGGCCGCAGAAACCAGTGGGTAGCGACT- GTTTA M . paratuberc . 

2 94 8 GGGATgCGGSqCGCAGAAACCAGTG^Gj^AGCGACT-GTTTA M . tuberculosis 

2 338 GGG^^GGjrGC-CACAAACCAGTGAGGAGCGACT - GTTTA M . ph 1 e i 

2 271 GGGATCCGG2CC-CAGAGACCAGTGAGAAGCGACT - GTTTA M . 1 epr ae 

1910 " " M. gastri 

1974 GGGATTCGGgCGCAGAAACCAGTGgG^AGCGACTgGTTTA M . kansasii 

442 5 G^GA^ TtiGGTGGCAiCAAACCAGTGAG'AAGCGACT - GTTTA M.smegmatis 



Figure 4J 
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2130 2140 2150 2160 

23 S7 CCGTTAACCCESr -AAGGGTGAAGCGGAGAATTTAAGCCC M . avium 

2 3 67 CCGTTAACCCGT - -AAGGGTGAAGCGGAGAATTTAAGCCC M . par atuber. 

3 0 67 CCGTTAACCCGC- -AAGGGTGAAGCGGAGAATTTAAGCCC M. tuberculO! 
2*57 CCGTTAA CCCTTTCGC-GGGTC-AAGCGGAGAATTTAAGCCC M . phlei 

23 SO CTGTTAACCCGA-TAAGGGTGA.-.GCGGAGAATTTAAGCCC M. leprae 
1910 " " M.gaatri^ 

2 0 94 CCGTTAACCCGC- -AAGGGTGAAGCGGAGAATTTAAGCCC M . kansasii 

4 544 C CGTTAACCC?CC-TC-3GGGTGAAGCGGAGAATTTAAGCCC M . smegma t is 



• • • 
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2260 2270 2280 
_j_ — i L 



24 8 5 GTAACGACTTCESS^TGTCTCAACCATAGACTCGGCGAA M . avium 
24 85 GTAACGACTTCCCAACTGTCTCAACCATAGACTCGGCGAA M . paratuber c . 
3185 GTAACGACTTC7CAACTGTCTCAACCATAGACTCGGCGAA M . tuberculosis 
2 5 7 7 GTAACGACTTCTCAACTGTCTCAACCATAGACTCGGCGAA M . phlei 
2508 GTAACGACTTCTCAACTGTCTCAACCATAGACTCGGCGAA M. leprae 
19 10 ~ M.gastri 

2212 GTAACGACTTCTCAACTGTCTCAACCATAGACTCGGCGAA M. kansasii 

4 6 63 GTAACGACTTCTCAACTGTCTCAAQ -'ATAGACTCGGCGAA M . smegma t i s 



Figure 4K 
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2605 GTTCGGTACC-GTTTC-TC-TAC-C-ATAC- C-TC-C-C-AC-AC'iiri^: M ■ avium 
2 60S GTTCGG T ACGGTTTGTGTAGGA7AGGTGGGAGACTTTGAA M . paratucerc 

ol OT^AOGGTTTGTGTAGGATAGGTGGGA^Cl^ H tuberculosa 
2 6 97 GGTCGATA.CGGTTTGTGTAGGATAGGTGGGA.GACTGTGAA M . phlei 
269.8 GTTCGG^CGGTTTGTGTAGGATAGGTGGGAGACTCTGAA M . leprae 

2332 GTTCGGTACGGTTTGTC-TAGGATAGGTGGGAGACIPTGAA M • kansasii 

4782 G;GTCGATACGGTTTGTGTAGGATAGGTGGGAGA.CTpTGAA M . smegmat is 



2410 2420 24 3 0 244 0 

, W c ■ c.cacAl-^CGCC^GT^rGT^ -GAGTCGTTGTTC-AAATACC M. avium 
2 6*5 ^CaSS-ACGCcVgTTT^GGAGTCGTTGTTGAAATACC M.paratuoerc. 

Ills ^^^C^TGCmP^^^^^ 0 «'^« CUlOBX8 
2737 GcbGGGACGCCAGTTCC-b'GTGGAGTCGTTGTTGAAATACC M.phlei 

2668 ^C T TCG a CGC^'AGTTxiGGGTGGAGTCGTTGTTG AAATACC M . leprae 
z ° oa Q^cjL^-*- - - - M.gastri 

2372 ScpfcSACGCCAGTTGGGGTGGAGTCGTTGTTGAAATACC M. kansasii 
4 822 GCTCAGACGCCAGTC-TGC-GTGGAGTCGTTGTTGAAATACC M.smegmatis 

, ■ 1 "I 

2450 2460 24 7 0 248 0 

2685 ACTCTGATCGTATTC-C-ACACCTAAC GTCC-AACCfeT^TC M. avium 
Ills aScTGATCGTATTGGACACCTAACGTCGAACCCT^ATC M.paratuberc 
Ills ACTCTGATCG Ti TTGGGCATCTAJ^c5TCGAJ^CCCTCAATC M . tuberculosis 
2 3 7 AcScGTATTGC^ M.phlex 
2708 ACTCTGA^GTATTGAACA^CTAACCTCGAACCgTATATC M.lepr.e 

2412 ACTCTGATCGTATTGGACACCTAACGTCGAACCCIpAATC M. kansasii 

4862 ACTCTGATCGTATTGC^CSCTAACCTCGgACCgTgTATC M.smegmatis 



Figure 4L 
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2 92* c^TGTC^TCa^ GC-ATAAAAC-GTACCCCC-TOGATAA^ M . avium 

232* GGTGTCACTCAACGGATAJLAAGGTACCCCGGGGATAACAG M . paratuberc . 
^ 625 GGTGTCCCTC^CGGATA^GGTACCCCGGGGATAACAG M . tuberculosis 

3 017 GG^GTC^CTCAACGGATAAAAGGTACCCCGGGGATA-ACAG M . phi e l 

2 9 a 8 GGTGTC^CTCAACGGATAAAAGGTACCCCGGGGATAACAIG M . leprae 
L -' M.gastn 
2652 GGTGTCGCTCAACGGATAAAAGGTACCCCGGGGATAACSG M . kansasii 

5102 GGTGTCGCTCAACGGATA-AAAGGTACCCCGGGGATAACAG M . smegtnat is 



• • • 



, i r- r 

2810 2820 283 0 284 0 

30 *a GGTCCCA^-GTTGGC-CTGTTCGCCC -ATTAAAGCGGCAU M . avium 
loll GGTCCCAAOGGTTGGGCTGTTCGCCC-ATTAAAGCGGCA.C M. paratuberc 
3745 GGTCCCAAyC-GTTGGC-CTGTTCGCCC-ATTAAAGCGGCAC M. tuberculosis 
3137 GGTCCCAAGGGTTGGGCTGTTCGCCC-ATTAAAGCGGCAC M.phlei 
3068 GGTCCCAAyGGTTGGGCTGTTCGCCC-ATTAAAGCGGCAC ^^ae 

2772 GGTCCCAAGGGTTGGGCTGTTCGCCC-ATTAAAGCGGCAC M. kansasii 

5222 GGTCCCAAGGGTTGGGCTGTTCGCCCCATTAAAGCGGCA.C M . smegmatis 



Figure 4M 
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3050 3060 3070 3080 

3283 C^VG^TCAGC-TTTC-TCACCCttlTI^^-GGATAAC-GCCC M . avium 
3 2 83 CAAGATCAGGTTTC-TCACCCTTTTAGAGGGATAAGGCCC M . paratuberc . 
2SBA CA a GATCAGGTTTC - TCACCCACTTGGJGGGATAAGGCCC M . tuberculosis 
3 3 7 6 CAAGAGCAGGCTTC- TCACCCTCTAqC-AGGGATAAGGCCC M . phlei 
3307 CAA " " M. leprae 

T 9 i0 __ M.gastri 

3 011 CAAGATCAGGC7TC - TCACCCACTTGG^GGGATAAGGCCC M . kansasii 

54 62 CAAG AjC CAGGCT 7 C - 7CACCC7CTAGGAGGGATAAGC-CCC M . smegma t i s 

, . ' — r 

30SO 3100 3110 3120 

, I L. 

3 322 rrmn - ar.a CGGC ^aTTgATAGGCI JAGACCTGGAAGCT M . avium 

3 322 CCCGC- AGAf CACGC-GATTGATAGGCCAGACCTGGAAGCT M . paratuberc . 

4 023 CCCGC -AGAA'cJ-.CGGCTTCAATAGGtCAGACCTGGAAGCT M . tuber cu los i s 
3 a 15 cCCGC-AGACCACGGGAT^-ATAGACCAGACCTGgAgGCg M . phlei 

3309 " M. leprae 

1910 " M.gastri^ 

3 050 CCCGC -AGASCACGGG^TjCGATAGGCCAGACCTGGAAGCT M . kansasn 

5501 CCCGC - AGACCACGGGATTGATAGACCAGACCTGGAAGCQ M . smegmatis 



Figure 4N 
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130 140 150 ISO 

r^CTaacACG^-C-Gi^TCTGCC CTC-CACTTC-GGGA-x-AA M . avium 

r^G^CACG^GGGC^TCTACCCTGCACTTC-GGGAT^ M . paratuoerc . 
CAGT AACACG'^GGGCAATCTGCCCTGCACTTC - GGGATAA M. scrof ulaceum " 



G^GTAACACGTGGGTGATCTGCCCTGCACTTC- GGGATAA M . tuberculosis 
209 GAG^AACACGTGGGTGATCTGCCCTGCACTTC- GGGATAA M.bovis 
GACTAACACGTC-GGTXATCTGCCCTGCACTTCSGGGATA.A M . leprae 

gaStSScgtgggcaatctg^ 



^^S^^^CCTOCACyc-GGGATAA M . gas tri 
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^50 460 470 480 



a2 a ^acCTCTTTCACCATCGACGAA C^TCCGGGtmaCTCGG M ■ avium 

3?6 ticCT^CACCATCGACGAAOTCCCXSGTTTTCTCGG M . intracellulare 

4 2* AAACCTCTTTCACCATCC-ACGA.AGGTCCGGGTTTTCT?.GG M.paratuberc. 

387 AtACCTCTTTCACCATCGACGAAGGCTCE3 TT 223 GG M . scrorulaceum 

3 89 A^CCT^ M . tuberculosis 

528 AAACCTCTTTCACCATCGACGAAGGTCCGGGTrgTCTCGG M.bovis 

43 9 AAACCTCTTTCACCATCGACGAAGGTC|GGGA^TTCTCGG M. leprae _ 

386 AAACCTCTTTC^CCATCGACGAAGGTCCGGGTT;GTCTCGG M.kansasil 

3 87 AAACCTCTTTC^.CCATCGACGAAGGTCCGGGTT'gTCTCGG M.gastri 



• • • 
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1130 1140 H50 116 0 

1104 TCT ™™^™r- rtW^ M. avium 

Hie TCTCATGTTGCCAGCGGGTAATGCCGGGGACTCGTGAGAG M . intracellulare 

109 8 TCTCATGTTGCCAGCGGGTAATGCAGGC-GACTCGTGAGAG M.paratuberc. 

1064 TCTCATGTTGCCAGCGGGTAATGCCGGGGACTCGTGAGAG M . scrof ulaceum 

106 9 TCTCATGTTGCCAGCAGGTAATGG^GGGGACTCGTGAGAG M . tuberculosis 

1208 TCTCATGTTGCCAGCAqGTAATGbTGGGGACTCGTGAGAG M.bovis 

1119 TCTCATGTTGCCAGCACGTAATGgTGGGGACTCGTGAGAG M . leprae _ _ 

1066 TCTCATGTTG CCAG CGGGT AATG CCGGGG ACTCGTGAGAG M.kansasil 

1067 TCTCATGTTGCCAGCGGGTAATGCCGGGGACTCGTGAGAG M.gastri 



Figure 5A 



32/32 



1290 1300 131 0 132 0 

126* cGAATfl SKOSSf^ M. avium 

lilt CGA^T'CCTT^T AAAG C CGG ACTCAGTTCGGATTGGGGTCT M . paratuperc 
122! CGA^^CCT^T~-J^_-.GCCGGTCTCAGTTCGGATi^GGGGTCT M . scrorulaceum 
Itll CGA^TCCTTA^AAAGCCGGTCTCAGTTCGGAl|clGGGGTCT M. tuberculosa 
'368 CG^CCTTA-.^X-CCGGMCTCAGTTCGGA^ M.bovis 
1 279 CG A^TCC^T^ AAAG C CGG^CTC AGTTCGG ATjCGGGGTCT M. leprae 

\lll CG^CCTTTT^GCCGgMcTCAGTTCGGAT^GC^ M.gastrx 



1330 1340 1350 1360 

nnA CCA ACTCGACCCd^'GAJ^GTCGGAGTCG CTAGTAATCCjUA. M. avium 
lilt GCAACTCGACCCCATGAAGTCGGA.GTCGCTAGTAATCGCA M . intracellulars 
^2 98 ^S^CCC^TGAAGTCX^TCGCTAGTAATCGCA M.paratuberc 
III. GCAACTCGACCCCGTGAAGTCGGAGTCGCTA.GTAATCGCA M . scrorulaceum 
lilt ^S^CCC^TGAAGTCX^GTOSCTAGTAATCGCA M . tuberculosa 

GCAACTCGACCCOGTGAAGTCGGAGTCGCTAGTAATCGCA M.bovis 
All gCAACTCGACCCOcWaAGTCGGAGTCGCTAGTAATCGCA M. leprae 
Ills GCAACTCGACCCCGTGAAGTCGGAGTCGCTAGTAATCGCA M.kansasia. 
1267 gCAACTCGACCCQSTGAAGTCGGAGTCGCTAGTAATCGCA M.gastr, 



Figure 5B 



